会议专题

Study on The Outlier Detection Algorithm Based on Clustering Reduction

The computation complexity of the algorithm for identifying density-based local outliers (LOF algorithm) is not ideal,which affects its applications in large scale data sets. Under such circumstances,an outlier detection algorithm based on kernel k-means clustering was proposed,which applied kernel k-means clustering on data sets to calculate representation degrees (rpd ) of clusters. Those clustering data sets with high rdp were carried out for the candidates of outliers which were left for the following outlier mining using LOF algorithm. The outlier detection algorithm based on kernel k-means clustering reduced the computation for neighborhood of the data sets,and shortens the execution time. Result in both theoretic analyses and experiments show that this algorithm is effective and efficient.

data mining outlier detection Kernel K-means clustering representation degrees

Zhang Yuanfang Qin Liangxi

School of Computer,Electronics and Information,Guangxi University,Nanning 530004,China

国际会议

2010 International Forum on Computer Science-Technology and Applications(2010 国际计算机科学技术应用论坛 IFCSTA 2010)

南宁

英文

152-156

2010-12-10(万方平台首次上网日期,不代表论文的发表时间)