Pipelined-MapReduce: An improved MapReduce Parallel programing model
MapReduce is a parallel programming model, and used to handle large datasets. The MapReduce program can be automatically concurrent executed in large-scale commodity machines. We proposed an improved MapReduce programming model—Pipelined-MapReduce, to solve the data intensive of information retrieval problems. Pipelined-MapReduce allows data transfer by pipeline between the operations, expanding the batched MapReduce programming model, and can reduce the completion time, and improve the system utilization rate. The experimental results demonstrate that the implemention of Pipelined-MapReduce can scale well and efficiently process large datasets on commodity machines.
MapReduce Hadoop Pipelined-MapReduce Parallel processing
Li Wang Zhiwei Ni Yiwen Zhang Zhang jun Wu Liyang Tang
Hefei University of Technology of school of Management, Hefei, Anhui, 230009, China Hefei University of Technology of school of Management, Hefei, Anhui, 230009, China Anhui University
国际会议
深圳
英文
871-874
2011-03-28(万方平台首次上网日期,不代表论文的发表时间)