会议专题

Web Text Categorization Based on Latent Semantic Analysis

Traditional text categorization methods are difficult to deal with the high dimensionality characteristic of the text document based on semantic concept of the words.This paper proposed a method to Web text categorization based on latent semantic analysis. Textual data was mapped into a lower space. The proposed approach used the singular-value decomposition to derive a latent semantic space.The SVM is used to text categorization in the semantic space.Experimental results show that the method is effective on the performance of the text categorization.

latent semantic analysis Web text categorization support vector machine

WANG Jianfeng YUAN Jinsha

Department of Electronic and Communication Engineering North China Electric Power University Baoding,Hebei province, 071003, China

国际会议

第一届国际计算机新科技与教育学术会议(Proceedings of the First International Conference on Computer Science & Education ICCSE2006)

厦门

英文

826-828

2006-07-27(万方平台首次上网日期,不代表论文的发表时间)