会议专题

A Text Classification Model Based on Training Sample Selection and Feature Weight Adjustement

A new text classification model based on training samples sdection and feature weight adjustment is presented. First it computes representativeness score of samplts so as to dishnguish noise samples from original training samples. Then a feature weight adjustment taking inter-class distribution and intra-class distribuhon into consideration is used to further improve the performance of text classification. The presented text classification model is applied on Chinese text dataset provided by Fudan Database Center. The experiments show that the proposed model can improve the performance of text classification to some extent with fewer training samples and fewer feature dimensions.

textclarssification representativeness score training dataset selection feature weight adjustment

Xuezeng Pang Yixing Liao

Department of Computer Science & Technology Zhejiang University Hangzhou, China Department of Computer Science & Technology Zhejiang University, Hangzhou,China Department of Inform

国际会议

The 2nd IEEE International Conference on Advanced Computer Control(第二届先进计算机控制国际会议 ICACC 2010)

沈阳

英文

294-297

2010-03-27(万方平台首次上网日期,不代表论文的发表时间)