An Efficient Algorithm for Finding Frequent Items in a Stream
Most of the existing algorithms for mining frequent items over data streams do not emphasis the importance of the more recent data items. We present an efficient algorithm where a fading factor λ is used for computing frequency counts exceeding a user-specified threshold over data streams. Our algorithm λ-Miner can detect e-approximate frequent items of a data stream using O(ε-1) memory space and the processing time for each data item is O(1). Experimental results on several artificial data sets and real data sets show that λ-Miner performs better than λ-LC in terms with precision, memory requirement and time cost.
data stream data mining frequent items fading factor
Li Tu Ling Chen Shan Zhang
Institute of Information Science and Technology, Department of Computer Science Nanjing University o Department of Computer Science, National Key Lab of Novel Software Tech Yangzhou University, Nanjing Department of Computer Science Yangzhou University Yangzhou, China
国际会议
Second International Symposium on Electronic Commerce and Security(第二届电子商务与安全国际研究大会)(ISECS 2009)
南昌
英文
856-860
2009-05-22(万方平台首次上网日期,不代表论文的发表时间)