Constructing a PU Text Classifier with Incremental Characteristic
Based on Focused Crawling,the paper designs and implements a PU text classification model with some incremental characteristic.For the case of negative set in the training samples which are not clear-cut,it first obtains a credible negative set by improving 1-DNF algorithm and then iterate trains the classifier,lastly obtains the final classifier for the theme of crawling text classification.The model learns some of the positive set and negative set in each training loop,then enters into the next training.It acquires a good self-adaptability,and reaches a good precision in the context of declining training samples.
PU Classification 1-DNF Focused Crawling, Incremental Characteristic
ShiYong Ling JinHong Gong
Center of Modern Education and Technology East China Jiaotong University, Nanchang,China School of Electrical Engineer, East China Jiaotong University, Nanchang,China
国际会议
西安
英文
1318-1323
2012-08-24(万方平台首次上网日期,不代表论文的发表时间)