Online hot topic detection from web news archive in short terms
In the age of information explosion,vast amount of information is now available on the internet.Early warning of breaking events is becoming a popular subject to study.Timeliness is the one of the most important factors to be considered in this subject.However,traditional topic detection approaches are always not so effective for the detection of emerging topics which concentrate all the news stories in a short term that make up the breaking topics.Therefore,a specific approach that applies to topic detection in short terms is required for this subject.In this paper,we utilize a temporal distance factor to measure the similarity between news stories and topics and then propose a novel approach based on it that could perform better in short-term topic detection.In the meantime,the aging theory was adopted in our scheme to build the life cycle model of events,from which we can get access to the hotness ranking at any time and lay a foundation for the research on early warning of breaking topics in the future.The following experiments indicate that our approach is effective and the life cycle model of events can basically conform to the reality.
component hot topic detection short terms aging theory life cycle model
RuiGuo Yu ManKun Zhao Peng Chang MuWen He
School of Computer Science and Technology Tianjin University, Tianjin, P. R. China Information and Network Center Tianjin University, Tianjin, P. R. China
国际会议
厦门
英文
931-935
2014-08-19(万方平台首次上网日期,不代表论文的发表时间)