Variable Selection based on Maximum Information Coefficient for Data Modeling
Whether the variable selection is accurate or not affect the accuracy and generalization ability of the model.The traditional variable selection method is difficult to maintain a high stability under high collinearity.In order to solve the problem,we propose a new method MICFS(Feature Select based on Maximal Information Coefficient),which combines the maximum information coefficient with the existing mutual information variable selection method.Firstly,this paper introduces the theory of mutual information and the variable selection algorithm based on mutual information,and then use the maximum information coefficient instead of the original mutual information criterion.Finally,the validity of method is verified by using the Friedman data set.The result shows that this method can meet the requirements of variable selection in a high collinearity and high noise environment.
mutual information variable selection maximum information coefficient
Fuchang Chu Zhenping Fan Baohui Guo Dan Zhi Zijian Yin Wenjie Zhao
College of Automation,North China Electric Power University Baoding
国际会议
重庆
英文
1714-1717
2017-03-25(万方平台首次上网日期,不代表论文的发表时间)