会议专题

A FAST ALGORITHM FOR CONSTRUCTING FP_TREE

Recently, most of the studies on mining frequent patterns focus on improving the efficiency of frequent itemtsets generations, but the I/O cost of database scanning has been a bottle-neck problem in data mining.Many algorithms proposed recently are based on A priori and FP_tree, and the FP_growth algorithm based on FPtree is more efficient than Apriori because the candidates are not generated.But the construction of FP_tree may spend much time.Therefore, the goal of our research is to propose a fast algorithm.In this paper.Level FP_tree that is constructed level by level (abbreviate LFP_tree) is proposed.The algorithm contains two main parts.The first is to scan the database only once for generating equivalence classes of each item.The second is to delete the non-frequent items and rewrite the equivalence classes of the frequent items, and then construct the LFP_tree.Experimental results have proved that LFP_tree is more efficient and scalable than FP_tree.

Frequent pattern FP_growth FP_tree LFP_tree Equivalence class

JIAO-MIN LIU SHENG GUO ZHEN-ZHOU WANG

College of Information Science and Engineering, Hebei University of Science and Technology, Shijiazh College of Information Science and Engineering, Hebei University of Science and Technology, Shijiazh College of Electrical Engineering, North China Electric Power University, Baoding 071003, Hebei, Chi

国际会议

2007 International Conference on Machine Learning and Cybernetics(IEEE第六届机器学习与控制论国际会议)

香港

英文

2390-2394

2007-08-19(万方平台首次上网日期,不代表论文的发表时间)