Mining Continuously Changing Data Streams

摘要：

A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Due to this reason, traditional data mining approach is replaced by systems that are able to mine continuous, high-volume, open-ended data streams as they arrive. Many papers have introduced new algorithms using clustering and classification methods to improve the mining of data streams. We have described the system used to build decision trees using constant memory and constant time per example. The cost effective techniques using output granularity algorithm is discussed. This algorithm is based on load shedding and rate based query optimization. The approach used in this algorithm is data rate adaptation. Data rate adaptation maximizes the output accuracy given available resources like memory, CPU utilization, and battery consumption etc. The data is generated rapidly and is continuously changing; therefore there are many issues that need to be taken care of by mining algorithms. In this paper we will discuss three such most widely used algorithms.

作者: LU Yi-hong WANG Zi-ren HUANG Yan

作者单位: College of Information Engineering, Zhejiang University of Technologeg Hangzhou, 310032, China University of North Texas, USA

会议类型: 国际会议

会议名称: 第七届国际测试技术研讨会

会议地点: 北京

会议语种:英文

在线出版日期: 2007-08-05（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Mining Continuously Changing Data Streams