Feature Selection through Optimization of k-Nearest Neighbor Matching Gain

摘要：

Many problems in Information processing involve some form of dimensionality reduction. In this paper, we propose a new model for feature evaluation and selection in unsupervised learning scenarios. The model makes no special assumptions on the nature of the data set. For each of the data set, the original features induce a ranking list of items in its k nearest neighbors. The evaluation criterion favors reduced features that result in the most consistent to these ranked lists. And an efficiently local descent search based on the model is adopted to select the reduced features. Our experiments with several data sets demonstrate that the proposed algorithm is able to detect completely irrelevant features and to remove some additional features without significantly hurting the performance of the clustering algorithm.

关键词： feature selection unsupervised learning k-nearest neighbor

作者: Yihui Luo Shuchu Xiong

作者单位: Department of Information Hunan University of Commerce Changsha, China

会议类型: 国际会议

会议名称: 2010 International Conference on Intelligent Computation Technology and Automation(2010 智能计算技术与自动化国际会议 ICICTA 2010)

会议地点: 长沙

会议语种:英文

页码: 1479-1482

在线出版日期: 2010-05-11（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Feature Selection through Optimization of k-Nearest Neighbor Matching Gain