会议专题

Study of Tibetan Text Categorization Based on Ensemble Learning Classifier

  Based on the text feature and syntatic structure of Tibetan,this theory mainly focuses on categorization of Tibetan through ensemble learning classified method.Combine KNN and Naive Bayesian to build a basic classifier on account of three character subsets over category of word character.And then take weighted calculation of basic classifier.For the last step,calculate the weight of basic classifier via gradient descend and reach final result of categorization.During experiment,choose recall ratio,precision ratio as well as other evaluation function to analyze and evaluate KNN,Naive Bayesian and text categorization.Conclusion from experiment shows that precision of calculation of Tibetan text based on ensemble learning method has been greatly improved.

Tibetan text Text categorization Ensemble learning Linear weighting Gradient descent

Li Ailin Li Ailin Yuan Bin

National Languages Information Technology Northwest University for Nationalities

国际会议

2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference(IAEAC 2015)(2015 IEEE先进信息技术,电子与自动化控制国际会议)

重庆

英文

69-72

2015-12-19(万方平台首次上网日期,不代表论文的发表时间)