Automatic Classification of Tibetan Web Pages
A classification approach for Tibetan web pages is introduced in this paper. It takes advantage of the class feature dictionary and Rocchio classification algorithm to classify the Tibetan web pages into the predefined classes rapidly and accurately. The experimental results present that the approach has better classification accuracy for Tibetan web pages classification. It is useful and helpful for the construction of the statistical and rule-based classification of Tibetan texts as well as construction of high-quality Tibetan corpus.
Tibetan Information Processing Text classification Classification of Web Pages
Guixian Xu Chuncheng Xiang Xu Gao Xiaobing Zhao Guosheng Yang
College of Information Engineering,, Minzu University of China, Beijing, China 100081 Minority Langu College of Information Engineering, Minzu University of China, Beijing, China 100081 North China Grid Company Limited, Beijing, China 100053 College of Information Engineering, Minzu University of China, Beijing, China 100081 Minority Langua College of Information Engineering,Minzu University of China, Beijing, China 100081
国际会议
杭州
英文
423-426
2012-03-23(万方平台首次上网日期,不代表论文的发表时间)