会议专题

The Research of an Improved Information Gain Method Using Distribution Information of Terms

The inadequacy of the information gain is taken into account the situation that the term does not appear. But, in this paper, by analyzing the distribution information of terms , we find if the value of Distribution Information inside a Class of the term becomes large, the distribution of the term inclines to imbalance, and if there is high imbalance of the term, the Distribution Information among Classes will tend to a smaller value. Therefore, the Distribution Information inside a Class and Distribution Information among Classes are introduced to this paper to reduce the effect of the term does not appear, and improve the traditional information gain. After experimental verification, the improved algorithm (GDI) has a better performance than traditional feature selection algorithm in some fields, such as the Information Gain.

YANG Yu-zhen LIU Pei-yu ZHU Zhen-fang QIU Ye

Department of Information Science and Engineering, Shandong Normal University,Jinan 250014, China D Department of Information Science and Engineering, Shandong Normal University,Jinan 250014, China Department of Information Science and Engineering, Shandong Normal University, Jinan 250014, China

国际会议

2009 IEEE International Symposium on IT in Medicine & Education( IEEE 教育与医药信息化国际会议)

济南

英文

938-941

2009-08-14(万方平台首次上网日期,不代表论文的发表时间)