会议专题

Analysis of Resource Usage Profile for MapReduce Applications Using Hadoop on Cloud

  In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2.We selected three applications and measured their resource usage in terms of CPU and memory footprint.Specifically,we study Grep,Word Count and Sort applications while altering Hadoop’s configuration parameters corresponding to I/O buffer.Our study brings up 3 key points.Firstly,effect of I/O parameters on total running time of the application;secondly,invalid assumptions of Hadoop scheduler that three phases (copy,sort and reduce) of a Reduce task are equal;finally,an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications.To the best of our knowledge this is the first work that presents resource usage study.

map reduce resource usage hadoop scheduler

Zheyuan Liu Dejun Mu

Control and Network Institute Northwestern Polytechnical University Xian, China

国际会议

2012 International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering & The 3rd International Conference on Maintenance Engineering (2012质量,可靠性,风险,维修性及安全性工程国际会议(QR2MSE 2012 & ICME 2012))

成都

英文

1506-1510

2012-06-15(万方平台首次上网日期,不代表论文的发表时间)