User Oriented Semi-supervised Document Clustering
In many text mining applications, it is needed to cluster documents according to demand of users. However, Traditional documents clustering that use unsupervised learning are not able to meet this demand. In this paper, a new clustering approach that focuses on the problem is proposed. Main contributions include: (1) Expresses user requirement by topic with multiple attributes (2) Annotates topic semantic by ontology, calculate dissimilarity between topic semantics and build dissimilarity matrix. Experiments show that new approach is effective.
Document clustering Ontology Topic Annotation HowNet Hierarchy Clustering
Rongfu Zhou Lihua Wang Dingxin shuai Ka Hei
The School of Information and Electricity Engineering PanZhiHua university,SiCuan,PanZhiHua 617000 C Pansteel Cold Rolling Mill,Panzhihua ,Sichuan 617000, China
国内会议
厦门
英文
1-6
2012-08-01(万方平台首次上网日期,不代表论文的发表时间)