A Chinese Web Documents Clustering Method Based on the Suffix Tree
An improved Chinese documents clustering method based on suffix tree is reported in this paper.The foundation of the improved method is the keyword,at the same time,joined the POS (part of speech)judgment.The STC (Suffix Tree Clustering) is introduced first,and then,its improved method including its realization details are given,accordingly,a cluster evaluation method is used to measure the performance of the proposed method.The experiments demonstrate that the proposed method can achieve better results than the STC does.
STC Chinese documents clustering keywords weight POS
Chiwen Wu
School of Information Technology,Jiangnan University,Wuxi Jiangsu China
国际会议
大连
英文
467-472
2008-07-27(万方平台首次上网日期,不代表论文的发表时间)