会议专题

Individualized Automatic Classification of Web Documents

This paper applies Na(i)ve Bayes classifier in designing customized automatic web document classification to systematically collecting massive news articles from the Internet. The proposed news classification system allows users to establish the necessary information classifications based on their own preferences. When the amount of daily news is increasing, this approach enables users to effectively filter through large amount of articles and more focused on interested articles. Performances of the proposed approach are characterized by the recall rate and precision. This system can achieve over 66% recall rate, and over 89% precision rate for a real-world Chinese test database.

Na(i)ve Bayes web documents classification

Yihjia Tsai Kaun-Yu Chen

Department of Computer Science and Information Engineering, Tamkang University, Taipei

国际会议

2010 Cross-Strait Conference on Information Science and Technology(2010 海峡两岸信息科学与技术学术交流会)

秦皇岛

英文

410-412

2010-07-09(万方平台首次上网日期,不代表论文的发表时间)