Web Page Classification based on Semi-supervised Naive Bayesian EM Algorithm
Web Classification is one of the hot researches in Web mining field. Within the exploded Internet information circumstance, most pages are unlabeled. This paper has proposed a Naive Bayesian EM algorithm classification method based on the feature of Semisupervised machine learning. The method used Hierarchical Clustering EM framework to train Naive Bayesian Classifier iteratively. The result of the experiment proved that the method introduced in the paper shows good effect of Web classification.
Web page classification web mining Semi supervised Hierarchical Clustering
Wang Zhixing Chen Shaohong
Computer Center East China Normal University Shanghai,China
国际会议
西安
英文
242-245
2011-05-13(万方平台首次上网日期,不代表论文的发表时间)