会议专题

A Text Clustering Scheme based on Semantic Description

This paper describes a technique called descriptive text clustering for large collections of short and medium length text documents. It consists of identification of related document clusters, selection of salient phrases relevant to these clusters and reallocation of documents matching the selected phrases to form final document groups. The advantages of this technique include more comprehensive cluster labels and clearer relationship between cluster labels and their content. The results of a computational experiment show that DTC slightly increases clustering quality.

Descriptive Text Clustering (DTC) Cluster Labels Pattern Phrases K-means Clustering Dominant Topics

Ming Tingtang Huo Junwei

Network Information Center Henan University Kaifeng, China

国际会议

2010 Third Pacific-Asia Conference on Web Mining and Web-based Application(2010年第三届web挖掘和基于web应用亚太会议 WMWA 2010)

桂林

英文

142-149

2010-11-17(万方平台首次上网日期,不代表论文的发表时间)