An Improved Feature Selection Algorithm Based on Markov Blanket
For decades, coronary artery heart disease(CHD) has been one of the most threatening diseases to human health. Syndrome pattern mining is one of the attempts researchers have been done to conquer this disease. The main issue of syndrome pattern mining is to confirm the correspondence between syndrome and syndroms subset, so it can be done through feature selection techniques. Feature selection is a critical unit in classification, which is used to classify syndroms into different syndromes, and can effectively improve the speed and accuracy. In this paper, we propose a novel feature selection algorithm based on Markov blanket and information gain(MB-IGFS) for syndroms classification problem. In particular, we give a new and intuitive measurement of condition independence between features and class labels, which is more accurate and easy for calculation. For evaluation, experiments were conducted on Breast Cancer Wisconsin (Diagnostic) Data Set. Results suggest that, compared with other feature selection methods, MB-IGFS is effective and efficient in eliminating irrelevant and redundant features. Then we used MB-IGFS to give optimal syndroms subsets for both Solid ZHENG and Virtual ZHENG syndrome. We conclude that MB-IGFS appears a very attractive solution in syndroms classification applications.
feature selection Markov blanket informa- tion gain coronary artery heart disease traditional Chinese medicine syndrome pattern discovery syndrome recognition
Xiaohan Zuo Peng Lu Xi Liu Yibo Gao Yiping Yang Jianxin Chen
Dept. of Integrated Information System Research Center Institute of Automation Chinese Academy of Sc Beijing University of Chinese Medicine Beijing, 100029, P. R. China
国际会议
上海
英文
1657-1661
2011-10-15(万方平台首次上网日期,不代表论文的发表时间)