Semi-supervised Classification Algorithm Based on the KNN
KNN algorithm is a classification algorithm based on examples. For a test documentation, we need to calculate the similarity with each text of the training sample focus, the computation complexity is very high. According to this problem, this paper puts forward a method based on the EM -KNN semi-supervised classification algorithm. Firstly, the algorithm to cluster the training set, calculate the center of each category, then combine the center of each category and the clustering the text to form new training set. Finally train the new training set with classical KNN algorithm. Experimental results show that computational complexity can be reduced largely and the performance of the classifier can be improved by this algorithm.
Semi-supervised EM KNN Classification Clustering
Yawei Chang Houquan Liu
School of Computer Science and Technology China University of Mining & Technology Xuzhou,China
国际会议
西安
英文
9-12
2011-05-13(万方平台首次上网日期,不代表论文的发表时间)