会议专题

SEMI-SUPERVISED K-MEANS CLUSTERING FOR MULTI-TYPE RELATIONAL DATA

In many data mining tasks, there is a large supply of unlabeled data but limited labeled data since it is expensive generated. Therefore, a number of semi-supervised clustering algorithms have been proposed, but few of them are specially designed for multi-type relational data. In this paper, a semi-supervised A-means clustering algorithm for multi-type relational data is proposed, which is based on the combination of semi-supervised /(-means method and multi-type relational data clustering. In order to achieve high performance, in the algorithm, we first analyze all kinds of relationships in data, which include intra-relationship, inter-relationship, explicit and implicit relationship; and then extend /-means clustering algorithm by seeding and new similarity measures, where attributes information, labeled data and all kinds of relationships are employed. The experimental results show the effectiveness of our method.

Semi-supervised learning clustering algorithm multi- type relational data

YING GAO HONG QI DA-YOU LIU HE LIU

College of Computer Science and Technology, Jilin University, Changchun, 130012, China

国际会议

2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)

昆明

英文

326-330

2008-07-12(万方平台首次上网日期,不代表论文的发表时间)