Optimizing the Classification Accuracy of Imbalanced Dataset Based on SVM
A dataset can be called imbalanced if at least one class of the data is represented by significantly less number of samples than the others. Imbalanced data generally exists in the real world. The classification performance of traditional machine learning algorithm is hampered in the classification tasks of imbalanced dataset. Support Vector Machines (SVM) is a new kind of machine learning method based on structural risk minimization principle and has had the best performance so far in several challenging applications. This paper summarizes the applications of SVM in imbalance dataset first and then presents some main improved methods which greatly improved the performance of classification in imbalanced dataset.
SVM imbalanced dataset machine learning
Zhang Sheng Shang Xiuyu Wang Wei Huang Xiuli
College of Mathematics, Physics and Information Engineering Zhejiang Normal University Jinhua China Department of Educational Technology Nanjing Normal University Nanjing China
国际会议
太原
英文
338-341
2010-10-22(万方平台首次上网日期,不代表论文的发表时间)