An Improved Parallel Association Rules Algorithm Based on MapReduce Framework for Big Data
Association rules mining is one of the most popular and significant issue in data mining and intends to discovery interest relations between variables in database.In our paper,we implemented an improved parallel Apriori algorithm which realized both count and candidate generation steps under MapReduce framework,while existing parallel Apriori algorithm only considered count step.We analyzed the time complexity of our improved parallel algorithm and compared to the original parallel algorithm,which indicates advantages of our algorithm with massive candidate item sets.Based on our experiment result,we proved that our algorithm performs better under big data situation and achieves excellent speedup feature.
Association Rules Apriori Hadoop MapReduce Data Mining
Xinhao Zhou Yongfeng Huang
Department of Electronic Engineering Tsinghua University Beijing,China
国际会议
厦门
英文
292-296
2014-08-19(万方平台首次上网日期,不代表论文的发表时间)