会议专题

Automatic Classification of Tibetan Web Pages

A classification approach for Tibetan web pages is introduced in this paper. It takes advantage of the class feature dictionary and Rocchio classification algorithm to classify the Tibetan web pages into the predefined classes rapidly and accurately. The experimental results present that the approach has better classification accuracy for Tibetan web pages classification. It is useful and helpful for the construction of the statistical and rule-based classification of Tibetan texts as well as construction of high-quality Tibetan corpus.

Tibetan Information Processing Text classification Classification of Web Pages

Guixian Xu Chuncheng Xiang Xu Gao Xiaobing Zhao Guosheng Yang

College of Information Engineering,, Minzu University of China, Beijing, China 100081 Minority Langu College of Information Engineering, Minzu University of China, Beijing, China 100081 North China Grid Company Limited, Beijing, China 100053 College of Information Engineering, Minzu University of China, Beijing, China 100081 Minority Langua College of Information Engineering,Minzu University of China, Beijing, China 100081

国际会议

2012 International Conference on Computer Science and Electronic Engineering(2012 IEEE计算机科学与电子工程国际会议 ICCSEE 2012)

杭州

英文

423-426

2012-03-23(万方平台首次上网日期,不代表论文的发表时间)