Improved Nonnegative Matrix Factorization Based Feature Selection for High Dimensional Data Analysis
Feature selection has become the focus of research areas of applications with high dimensional data.Nonnegative matrix factorization (NMF) is a good method for dimensionality reduction but it cant select the optimal feature subset for its a feature extraction method.In this paper,a two-step strategy method based on improved NMF is proposed.The first step is to get the basis of each catagory in the dataset by NMF.Added constrains can guarantee these basises are sparse and mostly distinguish from each other which can contribute to classfication.An auxiliary function is used to prove the algorithm convergent.The classic ReliefF algorithm is used to weight each feature by all the basis vectors and choose the optimal feature subset in the second step.The experimental results revealed that the proposed method can select a representive and relevant feature subset which is effective in improving the performance of the classifier.
feature selection nonnegative matrix factorization reliefF algorithm
Lincheng Jiang Wentang Tan Zhenwen Wang Fengjing Yin Bin Ge Wendong Xiao
Science and Technology on Information Systems Engineering Laboratory National University of Denfense Technology Chang Sha,China
国际会议
杭州
英文
2329-2332
2013-03-22(万方平台首次上网日期,不代表论文的发表时间)