A Hybrid Feature Selection Method for Data Sets of thousands of Variables
Feature selection has become the focus of research areas of applications with datasets of thousands of variables. In this study we present a hybrid feature selection (HFS) method that adopts both filter and wrapper models of feature subset selection. In the first stage of the feature selection, we use the filter model to rank the features by the mutual information (MI) between each feature and each class, and then choose k highest relevant features to the classes. In the second stage, we complete a wrapper model based feature selection algorithm, which uses Shepley value to evaluate the contribution of features to the classification task in a feature subset Experimental results show obviously that the HFS method obtains better classification performance than solo Shepley value based or solo MI based feature selection method.
feature selection Shepley value mutual information
Jihong Liu Guoxiong Wang
College of Information Science and Engineering Northeastern University Shenyang, China Political Department Liaoning Military Region Shenyang, China
国际会议
The 2nd IEEE International Conference on Advanced Computer Control(第二届先进计算机控制国际会议 ICACC 2010)
沈阳
英文
288-291
2010-03-27(万方平台首次上网日期,不代表论文的发表时间)