A Text Classification Model Based on Training Sample Selection and Feature Weight Adjustement
A new text classification model based on training samples sdection and feature weight adjustment is presented. First it computes representativeness score of samplts so as to dishnguish noise samples from original training samples. Then a feature weight adjustment taking inter-class distribution and intra-class distribuhon into consideration is used to further improve the performance of text classification. The presented text classification model is applied on Chinese text dataset provided by Fudan Database Center. The experiments show that the proposed model can improve the performance of text classification to some extent with fewer training samples and fewer feature dimensions.
textclarssification representativeness score training dataset selection feature weight adjustment
Xuezeng Pang Yixing Liao
Department of Computer Science & Technology Zhejiang University Hangzhou, China Department of Computer Science & Technology Zhejiang University, Hangzhou,China Department of Inform
国际会议
The 2nd IEEE International Conference on Advanced Computer Control(第二届先进计算机控制国际会议 ICACC 2010)
沈阳
英文
294-297
2010-03-27(万方平台首次上网日期,不代表论文的发表时间)