Individualized Automatic Classification of Web Documents
This paper applies Na(i)ve Bayes classifier in designing customized automatic web document classification to systematically collecting massive news articles from the Internet. The proposed news classification system allows users to establish the necessary information classifications based on their own preferences. When the amount of daily news is increasing, this approach enables users to effectively filter through large amount of articles and more focused on interested articles. Performances of the proposed approach are characterized by the recall rate and precision. This system can achieve over 66% recall rate, and over 89% precision rate for a real-world Chinese test database.
Na(i)ve Bayes web documents classification
Yihjia Tsai Kaun-Yu Chen
Department of Computer Science and Information Engineering, Tamkang University, Taipei
国际会议
2010 Cross-Strait Conference on Information Science and Technology(2010 海峡两岸信息科学与技术学术交流会)
秦皇岛
英文
410-412
2010-07-09(万方平台首次上网日期,不代表论文的发表时间)