Improved SMOTEBagging and Its Application in Imbalanced Data Classification
Many real world data mining applications involve imbalanced data sets, When all kinds of data are unevenly distributed and the particular evens of interest may be very few when compared to the other class. Data sets that contain rare evens usually produces biased classifiers that have a higher predictive accuracy over the majority class, but poorer predictive accuracy over the minority class of interest. This paper presents a novel ensemble algorithm with improved SMOTE, and combines selective ensemble with Bagging, which balances the class distribution with improved SMOTEBagging algorithm. Experiments on four UCI data sets and protein-protein interaction experiments mentioned above prove the performance of the method.
component:SMOTE Bagging SVM Imbalanced Datasets
Zhang Yongqing Zhu Min Zhang Danling Mi Gang Ma Daichuan
School of Computer Science Sichuan University Chengdu, Sichuan Province, China School of Life Science, Sichuan University Chengdu, Sichuan Province, China School of Chemistry Scie
国际会议
重庆
英文
1027-1031
2011-08-20(万方平台首次上网日期,不代表论文的发表时间)