A Parallel Algorithm for Frequent Itemset Mining
Frequent itemsets mining plays an essential role in data mining. In this paper, a new algorithm PFP-growth (Parallel FP-growth), which is based on the improved FP-growth, is proposed for parallel frequent itemset mining.The new algorithm distributes the task fairly among the parallel processors. We devise partitioning strategies at different stages of the mining process to achieve balance between processors and adopt some data structure to reduce the information transportation between processors. The experiments on national high performance parallel computer show that the PFP-growth is an efficient parallel algorithm for mining frequent itemset.
Parallel Frequent Itemset Data mining PFP-growth
Li Li Donghai Zhai Fan Jin
School of Computer and Communications Engineering Southwest Jiaotong University, Chengdu, 610031, P.R.CHINA
国际会议
成都
英文
868-871
2003-08-27(万方平台首次上网日期,不代表论文的发表时间)