Margin Based Sample Weighting for Stable Feature Selection
Stability of feature selection is an important issue in knowledge discovery from high-dimensional data. A key factor affecting the stability of a feature selection algorithm is the sample size of training set. To alleviate the problem of small sample size in high-dimensional data, we propose a novel framework of margin based sample weighting which extensively explores the available samples. Specifically, it exploits the discrepancy among local profiles of feature importance at various samples and weights a sample according to tim outlying degree of its lo cal profile of feature importance. We also develop an efficient algorithm under the framework. Experiments on a set of public microarray datasets demonstrate that the proposed algorithm is effective at improving the stability of state-of-the-art feature selection algorithms, while maintain ing comparable classification accuracy on selected features.
Yue Han Lei Yu
State University of New York at Binghamton Binghamton, NY 13902, USA
国际会议
11th International Conference,WAIM 2010(第十一届网络时代管理国际会议)
九寨沟
英文
680-691
2010-07-14(万方平台首次上网日期,不代表论文的发表时间)