A novel Text Classification Based on Mahalanobis distance
In text mining Held, The KNN (K Nearest Neighbors) is one of the oldest and simplest methods of text classification. But it is known to be sensitive to the distance (or similarity) function used in classifying a test instance, this disadvantage can cause low classification accuracy and limit the KNN classifiers utilization in text classification in text mining. In this paper, we introduce Mahalanobis distance in text classification area, and proposed an algorithm (MDKNN) base on this theory. Experiment show that our method has comparable or better performance than KNN Classifier and Naive Bayes classifier in text classification.
Mahalanobis distance KNN Classifier Text Classification Chinese
Suli Zhang Xin Pan
School of Electrical & Information Technology Changchun Institute of Technology Changchun,China
国际会议
上海
英文
156-158
2011-03-11(万方平台首次上网日期,不代表论文的发表时间)