Building quotient cube with MapReduce in hadoop

摘要：

　　In order to solve the problem that how to improve the efficiency of query and calculation in massive data,a method of building quotient cubes in Hadoop plateform which combined the advantage of the quotient cube and MapReduce model is proposed in this paper.At first,all cubes will be established and their aggregate value will be calculated in the Mapping stage.All the key/value pair formed in Mapping stage will be passed to Reducing stage.Equivalence partitioning will be carried out In this stage,and the minimum aggregation cube of each equivalence partitioning will be the key with its aggregate value.According to the minimum aggregation cubes,we can get the quotient cubes.In order to improve the speed of parallel computing and reduce network traffic,equivalence class division will be executed locally after the Map stage,it is named as combiner stage.In this paper,MapReduce model is used to improve the efficiency of building quotient cube because of its ability of parallel computing in a large amount of data.In addition,the experiment proved that,under certain circumstances,increasing the number of Mapper/Reducer task can reduce the building time effectively,and improve the construction efficiency.

关键词： Data warehousing, Quotient cube, Hadoop plateform, MapReduce model cloud computing Datas

作者: Juan Zhang

作者单位: Department of Computer Science and Technology, East China Normal University, Shanghai 200241, China

会议类型: 国际会议

会议名称: 2013 2nd International Symposium on Computer,Communication,Control and Automation(ISCCCA-13)(2013年第二届计算机、通信与自动化国际会议)

会议地点: 太原

会议语种:英文

页码: 643-646

在线出版日期: 2013-04-06（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Building quotient cube with MapReduce in hadoop