Blogger clustering by utilizing link information
Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the bloggers browse behavior, which reflects the users interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective.
Blogger cluster PLSA Model K-medoids algorithm
Lu Lu Fuxi Zhu
School of Computer,Wuhan University,Wuhan, China
国际会议
厦门
英文
267-270
2010-10-29(万方平台首次上网日期,不代表论文的发表时间)