MapReduce-based H-mine algorithm
Frequent Itemset Mining (FIM) is a very effective method for knowledge acquisition from data, but with the advent of the era of big data, traditional algorithms based on memory are facing severe challenges such as the computation speed and storage capacity. Fortunately, MapReduce model provides an efficient framework for distributed programming and operation framework. This paper proposes a novel MapReduce-based H-mine algorithm (MRH-mine), a version of H-mine algorithm in the distributed operation environment. Experimental results show that MRH-mine algorithm has a better performance and scalability than traditional H-Mine when facing massive data growth.
distributed data mining MapReduce H-mine parallelization
Xingjie Feng Jie Zhao Zhiyuan Zhang
computer science & technology CAUC Tian Jin,China
国际会议
秦皇岛
英文
1755-1760
2015-09-18(万方平台首次上网日期,不代表论文的发表时间)