An Outlier Mining Algorithm Based on Confidence Interval
Outlier detection is a hot topic of data mining. After studying the existing classical algorithms of detecting outlier, this paper proposes an outlier mining algorithm based on confidence interval, and makes a new definition for outlier. The method combines mathematical statistics and densitybased clustering algorithm. It clustering firstly with DBSCAN algorithm, obtains credible sample and suspicious outliers. Secondly, a confidence interval is obtained based on credible sample, then suspicious outliers will be detected and disposed using the confidence interval. The experiment results on IRIS show that this algorithm can detect outliers effectively.
Outlier Clustering Confidence Interval Stratified sampling
Yue Zhang Xuehua Yang Hang Li
Software College Shenyang NormalUniversity Shenyang 110034, China Software College Shenyang Normal University Shenyang 110034, China
国际会议
成都
英文
1-4
2010-04-16(万方平台首次上网日期,不代表论文的发表时间)