会议专题

A Novel Algorithm DBCAPSIC for Clustering Non-Numeric Data

  Data mining techniques are playing an important role in the analysis of mass network information and big data nowadays.The cluster analysis,as a main kind of method in data mining,draws great interest from researchers of various fields who proposed many algorithms such as k-means algorithm and its variants,density-based algorithm and its variants.However,these algorithms all have their own problems.This paper focuses on some of the problems and proposes a novel algorithm DBCAPSIC.The algorithm overcomes the k-means algorithms sensitivity to initial conditions and avoids common density-based algorithms clustering failure in some cases.Also,the algorithm has the linear time complexity of O(n),compared to the quadratic time complexity of common density-based clustering algorithms.

k-means density DBCAPSIC sensitivity clustering failure

Jinkun Geng Daren Ye Ping Luo

School of Software,Beihang University,Beijing 100191,China Key Laboratory for Information System Security, Ministry of Education;Tsinghua National Laboratory f

国际会议

2015 IEEE Advanced Information Technology, Electronic and Automation Control Conference(IAEAC 2015)(2015 IEEE先进信息技术,电子与自动化控制国际会议)

重庆

英文

295-304

2015-12-19(万方平台首次上网日期,不代表论文的发表时间)