会议专题

Mixture Models for Web Page Classification

This paper presents a method for designing semisupervised classifier trained on labeled and unlabeled instances. We explore the trade-off maximizing a generative likelihood of labeled and unlabeled data. Moreover, mixture models are an interesting and flexible model family. The different uses of mixture models include for example generative models and density estimation. This paper investigates semi-supervised learning of mixture models using a unified objective function taking both labeled and unlabeled data into account. We conducted experiments on the WebKB and 20NEWSGROUPS. The results show that unlabeled data results in improvement in classification accuracy over the supervised model.

semi-supervised learning mixture model generative model

Bai JingHua Zhang XiaoXian Li ZhiXin Li XiaoPing

School of Computer Science and technology Beijing Institute of Technology Beijing, P.R. China Department of software Changchun Institute of Technology ChangChun, P.R. China

国际会议

2010 Second Asia-Pacific Conference on Information Processing(2010年第二届亚太地区信息处理国际会议 APCIP 2010)

南昌

英文

80-83

2010-09-17(万方平台首次上网日期,不代表论文的发表时间)