Application of Improved Random Forest Variables Importance Measure to Traditional Chinese Chronic Gastritis Diagnosis
Many machine learning approaches have been proposed to establish the chronic gastritis diagnostic models. But till now, most of the machine-learning classifiers do not give any insight as to which features play key roles with respect to the derived classifier as well as the individual class. Recently, the variables importance measure yielded by random forest (RF) has been proposed in many applications. However, in multi-label classifications RF attempts to yield a common feature ranking for all classes, which fail in identifying the distinct predictive structures for individual class. This paper developed an improved random forest variables importance measure to evaluate the importance offeatures according to each individual class in multi-classification problem, and then applied a wrapper method for feature selection to construct the key features sets referring to each subtype of the chronic gastritis. Experiment results show that, compared with the previous studies, the selected features are more close to expert knowledge and contribute to better understanding of the underlying process that characterize the chronic gastritis.
Huazhen Wang Chengde Lin Yanqing Peng Xueqin Hu
School Of Information Science and Technology,Xiamen University,361005,P.R.China College of Basic Medical Science,Shanghai University of Chinese Medicine,201203,P.R.China
国际会议
厦门
英文
84-89
2008-12-12(万方平台首次上网日期,不代表论文的发表时间)