会议专题

Chinese Text Classification Based On LDA and KSVM

  With the rapid development of information technologyand social networking, the amount of generated text data has increased enormously. As one of the crucial technologies for information organization and management, text classification has become much more significant in the area of machine learning and natural language processing. According to this paper, we present a text classification system. First, we apply LDA topic model to express the text instead of Boolean model or vector space model. Then, we choose KSVM which combines SVM with KNN as the classification algorithm. Finally, we choose documents with large amount of Chinese news for experiments. Compared with normal language models, these experimental data shows that our system gets higher classification accuracy.

Machine Learning Text Classification LDA KSVM KNN SVM-KNN

Congwei Liang Yong Liu Haiqing Du

Beijing University of Posts and Telecommunications, Beijing, China

国际会议

2015 Joint International Mechanical,Electronic and Information Technology Conference(JIMET 2015)(2015 联合国际机械,电子与信息技术国际会议)

重庆

英文

379-383

2015-12-18(万方平台首次上网日期,不代表论文的发表时间)