A Text Clustering Scheme based on Semantic Description
This paper describes a technique called descriptive text clustering for large collections of short and medium length text documents. It consists of identification of related document clusters, selection of salient phrases relevant to these clusters and reallocation of documents matching the selected phrases to form final document groups. The advantages of this technique include more comprehensive cluster labels and clearer relationship between cluster labels and their content. The results of a computational experiment show that DTC slightly increases clustering quality.
Descriptive Text Clustering (DTC) Cluster Labels Pattern Phrases K-means Clustering Dominant Topics
Ming Tingtang Huo Junwei
Network Information Center Henan University Kaifeng, China
国际会议
桂林
英文
142-149
2010-11-17(万方平台首次上网日期,不代表论文的发表时间)