会议专题

Multi-level topic detection algorithm for Netnews Specials

This paper investigates the topic detection method in Netnews Specials Detection (NSD). We found that when the traditional clustering algorithms are used in NSD, the same topic is usually split into several pieces and the result is not-satisfying. So a new algorithm is proposed which uses a multilevel model, better suited for NSD. Firstly, such algorithm elevates the accuracy of single-layer clustering by introducing hot search words, a selective dictionary, and an advanced weight formula. Secondly, the multiple-level model not only avoids the problem of topic over-split but also establishes a structure for Netnews Specials, which lays the foundation for quick viewing, positioning and retrieval. Experimental results show that the algorithm in the real test corpus have high accuracy, doing a better job than the traditional clustering method.

Netnews Specials Vector Space Model (VSM) topic detection model Natural Language Processing (NLP)

Yu Peng ZhiQing Lin Bo Xiao Chuang Zhang

PRIS laboratory Beijing University of Posts and Telecommunications BUPT Beijing, China

国际会议

The 13th IEEE Joint International Computer Science and Information Technology Conference(2011年第13届IEEE联合国际计算机科学与信息技术会议 JICSIT 2011)

重庆

英文

1545-1549

2011-08-20(万方平台首次上网日期,不代表论文的发表时间)