Integrating Background Knowledge into RBF Networks for Text Classification

摘要：

Text classification is a problem applied to natural language texts that assigns a document into one or more predefined categories,based on its content.In this paper,we present an automatic text classification model that is based on the Radial Basis Function (RBF) networks.It utilizes valuable discriminative information in training data and incorporates background knowledge in model learning.This approach can be particularly advantageous for applications where labeled training data are in short supply.The proposed model has been applied for classifying spam email,and the experiments on some benchmark spam testing corpus have shown that the model is effective in learning to classify documents based on content and represents a competitive alternative to the well-known text classifiers such as naive Bayes and SVM.

关键词： Radial basis function networks text classification clustering information retrieval

作者: Eric P.Jiang

作者单位: University of San Diego,5998 Alcala Park San Diego,California 92110,United States of America

会议类型: 国际会议

会议名称: 4th Asia Information Retrieval Symposium(AIRS 2008)(第四届亚洲信息检索研讨会)

会议地点: 哈尔滨

会议语种:英文

页码: 61-70

在线出版日期: 2008-01-16（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Integrating Background Knowledge into RBF Networks for Text Classification