会议专题

A novel Text Classification Based on Mahalanobis distance

In text mining Held, The KNN (K Nearest Neighbors) is one of the oldest and simplest methods of text classification. But it is known to be sensitive to the distance (or similarity) function used in classifying a test instance, this disadvantage can cause low classification accuracy and limit the KNN classifiers utilization in text classification in text mining. In this paper, we introduce Mahalanobis distance in text classification area, and proposed an algorithm (MDKNN) base on this theory. Experiment show that our method has comparable or better performance than KNN Classifier and Naive Bayes classifier in text classification.

Mahalanobis distance KNN Classifier Text Classification Chinese

Suli Zhang Xin Pan

School of Electrical & Information Technology Changchun Institute of Technology Changchun,China

国际会议

2011 3rd IEEE International Conference on Computer Research and Development(ICCRD 2011)(2011第三届计算机研究与发展国际会议)

上海

英文

156-158

2011-03-11(万方平台首次上网日期,不代表论文的发表时间)