会议专题

A Chinese Web Documents Clustering Method Based on the Suffix Tree

An improved Chinese documents clustering method based on suffix tree is reported in this paper.The foundation of the improved method is the keyword,at the same time,joined the POS (part of speech)judgment.The STC (Suffix Tree Clustering) is introduced first,and then,its improved method including its realization details are given,accordingly,a cluster evaluation method is used to measure the performance of the proposed method.The experiments demonstrate that the proposed method can achieve better results than the STC does.

STC Chinese documents clustering keywords weight POS

Chiwen Wu

School of Information Technology,Jiangnan University,Wuxi Jiangsu China

国际会议

2008年国际电子商务、工程及科学领域的分布式计算和应用学术研讨会(2008 International Symposium on Distributed Computing and Applications for Business Engineering and Science)

大连

英文

467-472

2008-07-27(万方平台首次上网日期,不代表论文的发表时间)