会议专题

Predicting Breast Cancer Survivability Using Random Forest and Multivariate Adaptive Regression Splines

In this paper, we propose a hybrid of random forest and multivariate adaptive regression splines algorithms for building a breast cancer survivability prediction model. We use random forest to perform a preliminary screening of variables and to receive a importance ranks. Then, the new dataset is extracted from initial WDBC dataset according to top-k important predictors and is input into the MARS procedure, which is responsible for building interpretable models for predicting breast cancer survivability. The capability of this combination method is evaluated using basic performance measurements (e.g., accuracy, sensitivity, and specificity) along with a 10-fold cross-validation. Experimental results show that the proposed method provides a higher accuracy and a relatively simple model.

Breast cancer random forest multivariate adaptive regression spline

Dengju Yao JingYang Xiaojuan Zhan

College of Computer Science and Technology, Harbin Engineering University, Harbin Heilongjiang, Chin College of Computer Science and Technology, Harbin Engineering University, Harbin Heilongjiang, Chin Department of Computer Science and Technology, Heilongjiang Institute of Technology, Harbin Heilongj

国际会议

2011 International Conference on Electronic & Mechanical Engineering and Information Technology(EMEIT 2011)(2011年机电工程与信息技术国际会议)

哈尔滨

英文

2204-2207

2011-08-12(万方平台首次上网日期,不代表论文的发表时间)