会议专题

An Outlier Mining Algorithm Based on Confidence Interval

Outlier detection is a hot topic of data mining. After studying the existing classical algorithms of detecting outlier, this paper proposes an outlier mining algorithm based on confidence interval, and makes a new definition for outlier. The method combines mathematical statistics and densitybased clustering algorithm. It clustering firstly with DBSCAN algorithm, obtains credible sample and suspicious outliers. Secondly, a confidence interval is obtained based on credible sample, then suspicious outliers will be detected and disposed using the confidence interval. The experiment results on IRIS show that this algorithm can detect outliers effectively.

Outlier Clustering Confidence Interval Stratified sampling

Yue Zhang Xuehua Yang Hang Li

Software College Shenyang NormalUniversity Shenyang 110034, China Software College Shenyang Normal University Shenyang 110034, China

国际会议

2010 2nd IEEE International Conference on Information Management and Engineering(2010年IEEE第二届信息管理与工程国际会议 IEEE ICIME 2010)

成都

英文

1-4

2010-04-16(万方平台首次上网日期,不代表论文的发表时间)