会议专题

New Approach for Distributed Clustering

Nowadays the data collections are huge and in most cases do not reside in a centralised location. The latter complicates the task of traditional data mining techniques, as datasets are distributed and often heterogeneous. In this paper we propose a distributed approach based on the aggregation of models produced locally. The datasets will be processed locally on each node to produce clusters from local data then, we construct global clusters hierarchically. The aim of this approach is to minimise the communications, maximise the parallelism and load balance the work among different nodes of the system, and reduce the overhead due to extra processing while executing the hierarchical clustering. This technique is evaluated and compared to the sequential version using benchmark datasets and the results are very promising.

Data Mining Distributed Data Mining Clustering OPTICS

Souhila Ghanem Tahar Kechadi A.Kamel Tari

Department of Computer Science, University of Bejaia Targa Ouzemour, Bejaia,Algeria School of Computer Science and Informatics University College Dublin, Belfield,Dublin 04,Ireland

国际会议

2011 IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services(第一届空间数据挖掘与地理知识服务国际学术会议 ICSDM 2011)

福州

英文

60-65

2011-06-29(万方平台首次上网日期,不代表论文的发表时间)