Feature Selection through Optimization of k-Nearest Neighbor Matching Gain
Many problems in Information processing involve some form of dimensionality reduction. In this paper, we propose a new model for feature evaluation and selection in unsupervised learning scenarios. The model makes no special assumptions on the nature of the data set. For each of the data set, the original features induce a ranking list of items in its k nearest neighbors. The evaluation criterion favors reduced features that result in the most consistent to these ranked lists. And an efficiently local descent search based on the model is adopted to select the reduced features. Our experiments with several data sets demonstrate that the proposed algorithm is able to detect completely irrelevant features and to remove some additional features without significantly hurting the performance of the clustering algorithm.
feature selection unsupervised learning k-nearest neighbor
Yihui Luo Shuchu Xiong
Department of Information Hunan University of Commerce Changsha, China
国际会议
长沙
英文
1479-1482
2010-05-11(万方平台首次上网日期,不代表论文的发表时间)