Ensemble based Data Stream Mining with Recalling and Forgetting Mechanisms
Using ensemble of classifiers on sequential chunks of training instances is a popular strategy for data stream mining.Aiming at the limitations of the existing approaches,we introduce recalling and forgetting mechanisms into ensemble based data stream mining,and put forward a new algorithm MAE(Memorizing based Adaptive Ensemble)to mine chunk-based data streams with concept drifts.Ensemble pruning is used as a recalling mechanism to select useful component classifiers for each incoming data chunk.Ebbinghaus forgetting curve is adopted as a forgetting mechanism to evaluate and replace the component classifiers in the memory repository.Experiments have been performed on datasets with different types of concept drifts.Compared with traditional ensemble approaches,the results show that MAE is a good algorithm with high and stable accuracy,less predicting time and moderate training time.
data stream mining Ebbinghaus forgetting curve recalling mechanism ensemble pruning
Yanhuang Jiang Qiangli Zhao Yutong Lu
State Key Laboratory of High Performance Computing,National University of Defense Technology,Changsh School of Computer and Information Engineering,Hunan University of Commerce,Changsha,410205,China
国际会议
厦门
英文
439-444
2014-08-19(万方平台首次上网日期,不代表论文的发表时间)