A Hierarchical Clustering for Categorical Data Based on Holo-entropy
High dimensional data clustering is a difficult task in clustering analysis.Subspace clustering is an effective approach.The principle of subspace clustering is to maximize the retention of the original data information while searching for the minimal size of subspace for cluster representation.Based on information entropy and Holo-entropy, we propose an adaptive high dimensional weighted subspace clustering algorithm.The algorithm employs information entropy to extract the feature subspace, uses class compactness which binding Holo-entropy with weight in subspace for sub-clusters merging instead of the traditional similarity measurement method, and it selects the most compacted two sub-clusters to merge to achieve the maximum degree clustering effect.The algorithm is tested on nine UCi dataset, and compared with other algorithms.Our algorithm is better in both efficiency and accuracy than the other existing algorithms and has high reproducibility.
Hierarchical Clustering Holo-entropy Subspace Categorical Data
Haojun Sun Rongbo Chen Shulin Jin Yong Qin
Department of Computer Science Shantou University Shantou, China State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing
国际会议
济南
英文
269-274
2015-09-11(万方平台首次上网日期,不代表论文的发表时间)