ONLINE DATA STREAM MINING OF RECENT FREQUENT ITEMSETS BASED ON SLIDING WINDOW MODEL
Online data stream mining is one of the most important issues in data mining. Identifying the recent knowledge can provide valuable information for the analysis of the data stream. In this paper, we proposed an one-pass data stream mining algorithm to mine the recent frequent itemsets in data streams with a sliding window basing on transactions. To reduce the cost of time and memory needed to slide the windows, each items is denoted a bit-sequence representations. Basing on Apriori property, this kind of representations can find frequent items in data streams efficiently. We named this method MRFI-SW (Mining Recent Frequent Itemsets by Sliding Window) algorithm. Experiment results show that the proposed algorithm not only attains highly accurate mining result, but also consumes less memory than existing algorithms for mining frequent itemsets over recent data streams.
Online data stream Data mining Sliding windows
JIA-DONG REN KE LI
College of Information Science and Engineering YanShan University, Qinhuangdao 066004, China
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
293-298
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)