会议专题

Research on domain Text Classification Algorithm based on semantics

currently, most text categorization based on semantics theory only stay in the study, with very few specific areas to study. Therefore, the need for further research and development needed for certain areas of text classification.We use the classification of related technologies to solve the problem of the lack of semantics, we will combine machine learning algorithm and the concept vector model,this paper will introduce a concept representation of text which makes use of domain ontology knowledge to obtain relationships between words in the text, and eventually form a concept vector space model as well as a simple vector distance classification will apply it to realize domain text categorization.

Domain ontology WordNet Text classification

Yan Jianzhuo Zhang guixi Fang Hying

Beijing University of Technology Electronic Information and Control Engineering Beijing 100124, China

国际会议

The 13th IEEE Joint International Computer Science and Information Technology Conference(2011年第13届IEEE联合国际计算机科学与信息技术会议 JICSIT 2011)

重庆

英文

1870-1873

2011-08-20(万方平台首次上网日期,不代表论文的发表时间)