会议专题

Study of High Dimensional Clustering Algorithm Based on Graph Partition

In many clustering applications, the data sets are high-dimensional, sparse and binary, resulting to the failure of traditional algorithms in handling these data. In this paper, we present a new clustering algorithm based on graph partition for high-dimensional data, which, by defining the feature vector of attribute-value distribution and the similarity of attribution-value distribution, and creating a sequence of smaller and smaller coarse graphs from the original base graph. The smallest coarse graph is then partitioned using a spectral method, and this partition is propagated back through the hierarchy of graphs. Thus, the corresponding data items in each partition are highly related. The analysis demonstrates that this algorithm is effective in clustering knowledge discover.

data mining high-dimensional clustering graph partition

GAO Yuan

School of Electronic and Computer Science and Technology North University of China Taiyuan, China

国际会议

The 2010 International Conference on Computer Application and System Modeling(2010计算机应用与系统建模国际会议 ICCASM 2010)

太原

英文

534-537

2010-10-22(万方平台首次上网日期,不代表论文的发表时间)