Analysis of Resource Usage Profile for MapReduce Applications Using Hadoop on Cloud

被引:0
|
作者
Liu, Zheyuan [1 ]
Mu, Dejun [1 ]
机构
[1] Northwestern Polytech Univ, Control & Network Inst, Xian 710072, Peoples R China
关键词
map reduce; resource usage; hadoop scheduler;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this paper we present a study of resource consumption profiles for MapReduce applications using Hadoop on Amazon EC2. We selected three applications and measured their resource usage in terms of CPU and memory footprint. Specifically, we study Grep, Word Count and Sort applications while altering Hadoop's configuration parameters corresponding to I/O buffer. Our study brings up 3 key points. Firstly, effect of I/O parameters on total running time of the application; secondly, invalid assumptions of Hadoop scheduler that three phases (copy, sort and reduce) of a Reduce task are equal; finally, an insight supported by the results from the experiments on ways to improve the Hadoop scheduler for running multiple jobs by capturing the resource consumption information of different applications. To the best of our knowledge this is the first work that presents resource usage study.
引用
收藏
页码:1500 / 1504
页数:5
相关论文
共 50 条
  • [1] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [2] MapReduce Based Analysis of Sample Applications Using Hadoop
    Ghazi, Mohd Rehan
    Raghava, N. S.
    [J]. APPLICATIONS OF COMPUTING AND COMMUNICATION TECHNOLOGIES, ICACCT 2018, 2018, 899 : 34 - 44
  • [3] A comparison of forecasting models for the resource usage of MapReduce applications
    Li, Yang Yuan
    Tien Van Do
    Nguyen, Hai T.
    [J]. NEUROCOMPUTING, 2020, 418 : 36 - 55
  • [4] Model Driven Performance Simulation of Cloud Provisioned Hadoop MapReduce Applications
    Alipour, Hanieh
    Liu, Yan
    Hamou-Lhadj, Abdelwahab
    Gorton, Ian
    [J]. 2016 IEEE/ACM 8TH INTERNATIONAL WORKSHOP ON MODELING IN SOFTWARE ENGINEERING (MISE), 2016, : 48 - 54
  • [5] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Humphrey, Marty
    [J]. 2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 74 - 82
  • [6] Data Analysis using Hadoop MapReduce Environment
    Merla, PrathyushaRani
    Liang, Yiheng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4783 - 4785
  • [7] Toward Optimal Resource Provisioning for Cloud MapReduce and Hybrid Cloud Applications
    Ruiz-Alvarez, Arkaitz
    Kim, In Kee
    Humphrey, Marty
    [J]. 2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 669 - 677
  • [8] Performance Analysis of Hadoop MapReduce on an OpenNebula Cloud with KVM and OpenVZ Virtualizations
    Magalhaes Vasconcelos, Pedro Roger
    de Araujo Freitas, Gisele Azevedo
    [J]. 2014 9TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2014, : 471 - 476
  • [9] Using Coq in specification and program extraction of Hadoop MapReduce applications
    Ono, Kosuke
    Hirai, Yoichi
    Tanabe, Yoshinori
    Noda, Natsuko
    Hagiya, Masami
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 7041 LNCS : 350 - 365
  • [10] Analysis of Virtualized Congestion Control in Applications Based on Hadoop MapReduce
    Moro, Vilson
    Pillon, Mauricio Aronne
    Miers, Charles Christian
    Koslovski, Guilherme Piegas
    [J]. HIGH PERFORMANCE COMPUTING SYSTEMS, WSCAD 2018, 2020, 1171 : 37 - 52