会议专题

Research on Optimization of Clustering by Fast Search and Find of Density Peaks

  CFSFDP is a clustering algorithm based on density peaks, which can cluster non-spherical data sets, and also has the advantages of fast clustering and simple realization.However, the global density threshold dc, which leads to the decrease of clustering quality, is specified without the consideration of spatial distribution of the data.Moreover, the data sets with multi-density peaks cannot be clustered accurately.To resolve the above shortcomings, an optimization of CFSFDP algorithm based on grid (GbCFSFDP) is proposed.To avoid the using of global de, first, the algorithm divides the data sets into smaller partitions by using the method of grid partitioning and performs local clustering on them.Then, GbCFSFDP merges the subclasses.Finally, data sets which are unevenly distributed and exist multi-density peaks are correctly classified.Simulation experiments of two typical data sets show that the proposed GbCFSFDP algorithm is more accurate than CFSFDP.

clustering density threshold grid partition mergig clusters

Jinlong Zheng Hao Sun Mingxin Zhang Guohai Zhang

School of Computer Science and Engineering Changshu Institute of Technology Changshu, China School of Computer Science and Technology Soochow University Suzhou, China School of Agricultural Engineering and Food Science Shandong University of Technology Zibo, China

国际会议

The 13th Web Information Systems and Applications Conference(第十三届全国web信息系统及其应用学术会议)(WISA2016)、The 1st Symposium on Big Data Processing and Analysis)( BDPA 2016)第一届全国大数据处理与分析学术研讨会、The 1st Workshop on Information System Security)(ISS2016)(第一届全国信息系统安全研讨会

武汉

英文

129-133

2016-09-23(万方平台首次上网日期,不代表论文的发表时间)