Research on domain Text Classification Algorithm based on semantics
currently, most text categorization based on semantics theory only stay in the study, with very few specific areas to study. Therefore, the need for further research and development needed for certain areas of text classification.We use the classification of related technologies to solve the problem of the lack of semantics, we will combine machine learning algorithm and the concept vector model,this paper will introduce a concept representation of text which makes use of domain ontology knowledge to obtain relationships between words in the text, and eventually form a concept vector space model as well as a simple vector distance classification will apply it to realize domain text categorization.
Domain ontology WordNet Text classification
Yan Jianzhuo Zhang guixi Fang Hying
Beijing University of Technology Electronic Information and Control Engineering Beijing 100124, China
国际会议
重庆
英文
1870-1873
2011-08-20(万方平台首次上网日期,不代表论文的发表时间)