New Approach for Distributed Clustering
Nowadays the data collections are huge and in most cases do not reside in a centralised location. The latter complicates the task of traditional data mining techniques, as datasets are distributed and often heterogeneous. In this paper we propose a distributed approach based on the aggregation of models produced locally. The datasets will be processed locally on each node to produce clusters from local data then, we construct global clusters hierarchically. The aim of this approach is to minimise the communications, maximise the parallelism and load balance the work among different nodes of the system, and reduce the overhead due to extra processing while executing the hierarchical clustering. This technique is evaluated and compared to the sequential version using benchmark datasets and the results are very promising.
Data Mining Distributed Data Mining Clustering OPTICS
Souhila Ghanem Tahar Kechadi A.Kamel Tari
Department of Computer Science, University of Bejaia Targa Ouzemour, Bejaia,Algeria School of Computer Science and Informatics University College Dublin, Belfield,Dublin 04,Ireland
国际会议
福州
英文
60-65
2011-06-29(万方平台首次上网日期,不代表论文的发表时间)