A Novel Reliable Negative Method Based on Clustering for Learning from Positive and Unlabeled Examples
This paper investigates a new approach for training text classifiers when only a small set of positive examples is available together with a large set of unlabeled examples.The key feature of this problem is that there are no negative examples for learning.Recently,a few techniques have been reported are based on building a classifier in two steps.In this paper,we introduce a novel method for the first step.which cluster the unlabeled and positive exampies to identify the reliable negative document,and then run SVM iteratively.We perform a comprehensive evaluation with other two methods,and show ex perimentally that it is efficient and effective.
Semi-Supervised Learning Text Classification Bisecting k-means Clustering Learning from Positive and Unlabeled Examples (LPU)
Bangzuo Zhang Wanli Zuo
College of Computer Science and Technology,Jilin University,ChangChun,130012,China;College of Comput College of Computer Science and Technology,Jilin University,ChangChun,130012,China
国际会议
4th Asia Information Retrieval Symposium(AIRS 2008)(第四届亚洲信息检索研讨会)
哈尔滨
英文
385-392
2008-01-16(万方平台首次上网日期,不代表论文的发表时间)