会议专题

Web Page Classification based on Semi-supervised Naive Bayesian EM Algorithm

Web Classification is one of the hot researches in Web mining field. Within the exploded Internet information circumstance, most pages are unlabeled. This paper has proposed a Naive Bayesian EM algorithm classification method based on the feature of Semisupervised machine learning. The method used Hierarchical Clustering EM framework to train Naive Bayesian Classifier iteratively. The result of the experiment proved that the method introduced in the paper shows good effect of Web classification.

Web page classification web mining Semi supervised Hierarchical Clustering

Wang Zhixing Chen Shaohong

Computer Center East China Normal University Shanghai,China

国际会议

2011 2nd International Conference on Data Storage and Data Engineering(DSDE 2011)(2011年第二届数据存储与数据工程国际会议)

西安

英文

242-245

2011-05-13(万方平台首次上网日期,不代表论文的发表时间)