会议专题

COMPARATIVE RESEARCH ON SHORT TEXT CATEGORIZATION

Resent year,various kinds of short text data has been continually emerging in large numbers,such as article summary,e-mail,instant messages online and so on. Although the text categorization techniques in a number of areas have been gotten very good research and applications, for relatively short and different structures of the short data, the categorization research is relatively less.In fact, in order to use handily,it is very necessary to do the categorization research and application.In this paper,based on existing research,we analyzed and research some current text categorization algorithm, according to the experimental data,we also analyzed and compared that,in short text data categorization,SVM algorithm has good performance, and we suggested some improvements just for reference.

Short Tezt Categorization K-Nearest Neighbor (K-NN) Naive Bayes (VB) Support Vector Machine (SVM)

Juan Chang Xueqin Lu

Software college,Ningbo Dahongying University,Ningbo 315175,China Ningbo Dahongying University,Ningbo 315175,China

国际会议

2009 International Symposium on Computer Science and Technology(2009 中国宁波国际计算机科学与技术学术大会)

宁波

英文

335-338

2009-11-20(万方平台首次上网日期,不代表论文的发表时间)