会议专题

Bursty Topic Detection on Twitter

Twitter is a user-generated content system that allows its users to share short text messages, called tweets. Every day millions of users post an extraordinarily large number of small messages, it is almost impossible for people to read all of them. In this paper, we propose a novel topic detection technique that permits to retrieve in real-time the burstiest topics expressed by the community. It first identifies bursty features by time series analysis over the twitter stream. The computation of weight of features takes account of various parameters including position and the number of replies. Then these features are grouped into topics by unsupervised learning clustering algorithm. A preliminary evaluation is carried out on an implementation of this technique that shows promising results.

Bursty topic detection Twitter analysis Time series analysis

YanYan Du

School of computer University of Wuhan Wuhan, Hubei Province, China

国际会议

2010 International Conference on Circuit and Signal Processing(2010年电路与信号处理国际会议 ICCSP 2010)

上海

英文

174-177

2010-12-25(万方平台首次上网日期,不代表论文的发表时间)