Web Text Categorization Based on Latent Semantic Analysis
Traditional text categorization methods are difficult to deal with the high dimensionality characteristic of the text document based on semantic concept of the words.This paper proposed a method to Web text categorization based on latent semantic analysis. Textual data was mapped into a lower space. The proposed approach used the singular-value decomposition to derive a latent semantic space.The SVM is used to text categorization in the semantic space.Experimental results show that the method is effective on the performance of the text categorization.
latent semantic analysis Web text categorization support vector machine
WANG Jianfeng YUAN Jinsha
Department of Electronic and Communication Engineering North China Electric Power University Baoding,Hebei province, 071003, China
国际会议
厦门
英文
826-828
2006-07-27(万方平台首次上网日期,不代表论文的发表时间)