Analysis of Resource Usage Profile for MapReduce Applications Using Hadoop on Cloud
In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2.We selected three applications and measured their resource usage in terms of CPU and memory footprint.Specifically,we study Grep,Word Count and Sort applications while altering Hadoop’s configuration parameters corresponding to I/O buffer.Our study brings up 3 key points.Firstly,effect of I/O parameters on total running time of the application;secondly,invalid assumptions of Hadoop scheduler that three phases (copy,sort and reduce) of a Reduce task are equal;finally,an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications.To the best of our knowledge this is the first work that presents resource usage study.
map reduce resource usage hadoop scheduler
Zheyuan Liu Dejun Mu
Control and Network Institute Northwestern Polytechnical University Xian, China
国际会议
成都
英文
1506-1510
2012-06-15(万方平台首次上网日期,不代表论文的发表时间)