An Algorithm for Mining Frequent Stream Data Items Using Hash Function and Fading Factor
A new algorithm to mine the frequent items in data stream is presented. The algorithm adopts a time fading factor to emphasize the importance of the relatively newer data, and records the densities of the data items in Hash tables. For a given threshold of density S and an integer k, our algorithm can mine the top k frequent items. Computation time for processing each data item is O(1). Experimental results show that the algorithm outperforms other methods in terms of accuracy, memory requirement, and processing speed.
Stream data mining frequent data item fading factor Hash function
Qingling Mei Ling Chen
Department of Computer Science, Yangzhou University, Yangzhou China, 225009 Department of Computer Science, Yangzhou University, Yangzhou China, 225009 State Key Lab of Novel S
国际会议
合肥
英文
2661-2665
2011-09-23(万方平台首次上网日期,不代表论文的发表时间)