Study of High Dimensional Clustering Algorithm Based on Graph Partition
In many clustering applications, the data sets are high-dimensional, sparse and binary, resulting to the failure of traditional algorithms in handling these data. In this paper, we present a new clustering algorithm based on graph partition for high-dimensional data, which, by defining the feature vector of attribute-value distribution and the similarity of attribution-value distribution, and creating a sequence of smaller and smaller coarse graphs from the original base graph. The smallest coarse graph is then partitioned using a spectral method, and this partition is propagated back through the hierarchy of graphs. Thus, the corresponding data items in each partition are highly related. The analysis demonstrates that this algorithm is effective in clustering knowledge discover.
data mining high-dimensional clustering graph partition
GAO Yuan
School of Electronic and Computer Science and Technology North University of China Taiyuan, China
国际会议
太原
英文
534-537
2010-10-22(万方平台首次上网日期,不代表论文的发表时间)