A Clustering Algorithm of No-Word-Segmentation for Chinese Search Engine Results
Along with information on the Internet increasing dramatically, People usually search and locate information that they needed by search engines. Clustering search engine results is an effective method to help people select information needed from the list of search engine results. The paper presents a clustering algorithm of no-word-segmentation for Chinese search engine results (CANWS). The algorithm firstly preprocesses the search engine results and then computes the similarities of the results based on the same sub-string. Lastly it clusters the results based on the similarity matrix. The paper also gives test and analysis of the algorithm performance by experiments.
search engine results clustering results similarity algorithm clustering algorithm
Deqing Wang Hui Zhang Liping Zhao Ke Xie
Beihang University, State Key Lab of Software Development Environment, Beijing 100083
国际会议
2007年第三届语义和知识网格国际会议(Third International Conference on Semantics,Knowledge,and Grid)(SKG 2007)
西安
英文
2007-10-29(万方平台首次上网日期,不代表论文的发表时间)