A Simple Clustering Knowledge Presentation Method on High-dimension Binary Data Set
The presentation and explanation of the clustering result play an important role in the technology of clustering . Based on the rough set theory on attribute space, a new clustering result presentation method is advanced. Firstly, the different properties of high-dimension binary data on object space and attribute space have been studied; secondly, the concepts of Low Approximation, Upper Approximation and Feature Precision have been defined for data set on attribute space and the Clustering Information Factor has been defined; and thirdly, a method of Clustering Knowledge Representation on Object Space and attributes space has been proposed. It is simple enough to be understood easily, can provide three kinds of information, that is the distribution of objects, the relationship of the objects distribution and the attributes set, and the rules how to assign new objects to clusters. It can provide relatively synthesis information of clustering result on object space and attribute space, reflect the clustering knowledge with rules, enable users to capture more useful pattern and to hold the internal structure of high-dimension binary data sets.
Clustering High-dimension Binary Data Clustering Result Presentation Rough set Theory
CHEN Jianbin Sun Jie Gao Shuli
Department of E-business, Business College of Beijing Union University,100025, Beijing, China
国际会议
北京
英文
2007-08-05(万方平台首次上网日期,不代表论文的发表时间)