会议专题

Frequent Items Mining on Data Stream Using Hash-Table and Heap

Most of the existing algorithms for mining frequent items on data stream do not emphasis the importance of the recent data items. We present an algorithm to detect the items with frequency counts exceeding a user-specified threshold. Our algorithm uses a hash table L and a heap to record the potential frequent items, and can detect ε-approximate frequent data items on data stream using O(|L|+ ε-1) memory space and the processing time for each data item is O(logε1).Experimental results on several artificial and real datasets show our algorithm has higher precision, requires less memory and consumes less computation time than other similar methods.

data mining data stream frequent items time fading model hash table heap.

Zhang Shan Chen Ling Tu Li

Department of Computer Science Yang Zhou University Yangzhou,China Institute of Information Science and Technology Naming University of Aeronautics and Astronautiscs N

国际会议

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems(2009 IEEE 智能计算与智能系统国际会议)

上海

英文

141-145

2009-11-20(万方平台首次上网日期,不代表论文的发表时间)