Interactive Mining Topic Evolutionary Patterns from Internet forums
In many real-world topic detection tasks, the process of the topic detection is often interactive, which means the users are likely to interfere the reason process by expressing their preferences. We proposed an algorithm, iOLDA, and the software framework for interactive topic evolution pattern detection based on Latent Dirichlet Allocation (LDA). To abate those topics not interested or related, it allows the users to add supervised information by adjusting the posterior topic-word distributions at the end of each iteration, which may influence the inference process of the next iteration. Experiments are conducted both on English and Chinese corpus and the results show that the extracted topics capture meaningful themes in the data, and the proposed interaction policies can help to discover better topics.
data mining probabilistic topic models topic evolutionary patterns
Bin Zhou Cui Kai Yan Jia Jing Li
School of Computer National University of Defense Technology Changsha, Hunan 410073, China Section of Information CNCERT/CC Beijing 100029, China
国际会议
上海
英文
76-81
2010-06-22(万方平台首次上网日期,不代表论文的发表时间)