会议专题

WEIGHTED CLUSTERING ALGORITHM FOR THE DATA WITH MIXED ATTRIBUTES

  Based upon W-k-means framework, a new clustering algorithm is proposed.In the algorithm, the concept of distribution centroid is introduced to represent the centre of cluster with categorical attributes, then distribution centroid and mean is combined to represent the centre of cluster of mixed numeric and categorical attribute data.A new dissimilarity measure, which takes into account the influence of different attributes in clustering process, is used to calculate the distance between data objects and the centre of cluster.In addition,the weight strategy of the W-k-means framework is used to assess the influence of different attributes.The proposed algorithm is verified via a series of experiments on real world datasets.Better clustering accuracy is demonstrated in comparison with those of traditionalclustering algorithms.

Cluster analysis Numericattribute data Categorical attribute data W-k-means algorithm

Yiyang Wang Li Wang Zhong Qian Bo Xu Chao Lei Yao Zhong Yue You

School of Economics and Management, Beihang University, Beijing 100191, China

国际会议

The 12th International Conference on Industrial Management(第十二届工业管理国际会议)

成都

英文

333-338

2014-09-03(万方平台首次上网日期,不代表论文的发表时间)