会议专题

AdaUK-Means: An Ensemble Boosting Clustering Algorithm on Uncertain Objects

  This paper considers the problem of clustering uncertain objects whose locations are uncertain and described by probability density functions(pdf).Though K-means has been extended to UK-means for handling uncertain data,most existing works only focus on improving the efficiency of UK-means.However,the clustering quality of UKmeans is rarely considered in existing works.The weights of objects are assumed same in existing works.However,the weights of objects which are far from their cluster representatives should not be the same as the weights of objects which are close to their cluster representatives.Thus,we propose an AdaUK-means to group the uncertain objects by considering the weights of objects in this article.In AdaUK-means,the weights of objects will be adjusted based on the correlation between objects by using Adaboost.If the object pairs are must-link but grouped into different clusters,the weights of the objects will be increased.In our ensemble model,AdaUK-means is run several times,then the objects are assigned by a voting process.Finally,we demonstrate that AdaUK-means performs better than UK-means on both synthetic and real data sets by extensive experiments.

Uncertain data UK-means Weighted objects Ensemble clustering Adaboost

Lei Xu Qinghua Hu Xisheng Zhang Yanshuo Chen Changrui Liao

School of Electronic and Communication Engineering,Shenzhen Polytechnic,Shenzhen,China School of Computer Science and Technology,Tianjin University,Tianjin,China Department of Economics,University of Wisconsin Madison,Madison,USA Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province

国际会议

第七届全国模式识别学术会议(The 7th Chinese Conference on Pattern Recognition,CCPR2016)

成都

英文

27-41

2016-11-03(万方平台首次上网日期,不代表论文的发表时间)