Research on domain Text Classification Algorithm based on semantics

摘要：

currently, most text categorization based on semantics theory only stay in the study, with very few specific areas to study. Therefore, the need for further research and development needed for certain areas of text classification.We use the classification of related technologies to solve the problem of the lack of semantics, we will combine machine learning algorithm and the concept vector model,this paper will introduce a concept representation of text which makes use of domain ontology knowledge to obtain relationships between words in the text, and eventually form a concept vector space model as well as a simple vector distance classification will apply it to realize domain text categorization.

关键词： Domain ontology WordNet Text classification

作者: Yan Jianzhuo Zhang guixi Fang Hying

作者单位: Beijing University of Technology Electronic Information and Control Engineering Beijing 100124, China

会议类型: 国际会议

会议名称: The 13th IEEE Joint International Computer Science and Information Technology Conference(2011年第13届IEEE联合国际计算机科学与信息技术会议 JICSIT 2011)

会议地点: 重庆

会议语种:英文

页码: 1870-1873

在线出版日期: 2011-08-20（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Research on domain Text Classification Algorithm based on semantics