Popular Topic Detection in Chinese Micro-Blog Based on the Modified LDA Model
Micro-blog has become a symbol of the novel social media, and because of its rapid development in such a short time,many research researchers are full of enthusiasm about it.We take use of Latent Dirichlet Allocation (LDA) Model which has excellent dimension reduction capability and can excavate latent semantic from texts to discover popular topics.We improve the original LDA model to FSC-LDA model by combining the text clustering methods and feature selection methods, which can identify the number of topics adaptively.FSC-LDA model can keep short micro-biog texts features better, and make the result more stable.The result of the experiments on real Chinese microblog text dataset shows that FSC-LDA model can perform well on the custom evaluation and find more accurate popular topics.
popular topics detection text clustering latent dirichlet allocation model FSC-LDA
Yuzhong Chen Wanhua Li Wenzhong Guo Kun Guo
College of Mathematics and Computer Science Fujian Key Laboratory of Network Computing and Intelligent Information Processing Fuzhou University FuZhou, China
国际会议
济南
英文
37-42
2015-09-11(万方平台首次上网日期,不代表论文的发表时间)