High Quality Algorithm for Chinese Short Messages Text Clustering Based on Semantic
Existing data clustering method lacks considering of latent similar information existing among words,and it leads to unsatisfactory clustering result.Aiming at Chinese short message text clustering,this paper proposes a clustering algorithm based on semantic.It offers Chinese concept,and the measuring methods to calculate the similarity degree about words and Chinese short message text.It completes the clustering of Chinese short messages text through fission downwards and mergence of twos upwards.Experimental results show that this algorithm has better clustering quality than traditional algorithm.
short messages text semantic concept similarity
Fengxia Yang
Department of Computer Science Cangzhou Normal University Cangzhou, China
国际会议
太原
英文
1263-1266
2012-12-08(万方平台首次上网日期,不代表论文的发表时间)