会议专题

Blogger clustering by utilizing link information

Blogs are a new form of internet phenomenon and a vast ever-increasing information resource, which are dated unedited, highly opinionated personal online commentary including all kinds of hyperlinks such as citation link, comment link, blogroll link. These links can be viewed as the bloggers browse behavior, which reflects the users interest to a certain extent. So we construct a blogger-post matrix, link analysis is considered in calculation of the entry of the matrix. With usage of probability latent semantic analysis, the conditional probability of latent variable Z to post P is transformed the the conditional probability of latent variable Z to post B, then the transformed results are used in similarity calculation. The k medoids algorithm is adopted to further improve clustering result. Experiment results have shown that this new algorithm is effective.

Blogger cluster PLSA Model K-medoids algorithm

Lu Lu Fuxi Zhu

School of Computer,Wuhan University,Wuhan, China

国际会议

2010 IEEE International Conference on Intelligent Computing and Intelligent Systems(2010 IEEE 智能计算与智能系统国际会议 ICIS 2010)

厦门

英文

267-270

2010-10-29(万方平台首次上网日期,不代表论文的发表时间)