会议专题

Hypothesis-margin model incorporating structure information for feature selection

Iterative search margin based algorithm (Simba) has been proven effective for feature selection. However, the previously proposed model does not effectively utilize the structure information hidden in data which may have a great impact on the generalization performance of post-analysis classifiers. In this paper, we introduce a novel hypothesis-margin model incorporating structure information for feature seection(Ssimba_FS). In the newly developed model, the structure information induced by clustering algorithms is incorporated into the existing hypothesis margin model for feature selection, and meanwhile the contribution of the structure information can be effectively adjusted by a trade-off parameter. Based on Ssintba_FS, we present a novel algorithm for feature selection(Ssimba). By Ssimba, an effectively ranked feature list can be obtained, futher a compact and relevant feature subset can be directly generated from the ranked feature list. The experiments on 6 real-life benchmark datasets show that the classifiers induced by the algorithm of this paper has better or comparable classification performance than those established by Simba in most cases.

hypothesis-margin feature selection structure information

Ming Yang Ping Yang

School of Mathematics and Computer Science Nanjing Normal University Nanjing,210097,P.R.China

国际会议

Second International Symposium on Electronic Commerce and Security(第二届电子商务与安全国际研究大会)(ISECS 2009)

南昌

英文

634-639

2009-05-22(万方平台首次上网日期,不代表论文的发表时间)