Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption

被引:8
|
作者
Lin, Jia-Chun [1 ]
Leu, Fang-Yie [2 ]
Chen, Ying-ping [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung, Taiwan
关键词
Intermediate-data replication; job completion reliability; job energy consumption; MapReduce;
D O I
10.1109/TPDS.2014.2374600
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, MapReduce has been widely employed by many companies/organizations to tackle data-intensive problems over a large-scale MapReduce cluster. To solve machine/node failure which is inevitable in a MapReduce cluster, MapReduce employs several policies, such as input-data replication and intermediate-data replication policies. To speed up job execution, MapReduce allows reduce tasks to early fetch their required intermediate data. However, the impact of these policy combinations on the job completion reliability (JCR for short) and job energy consumption (JEC for short) of a MapReduce cluster was not clear, where JCR is the reliability with which a MapReduce job can be completed by the cluster, whereas JEC is the energy consumed by the cluster to complete the job. Therefore, in this study, we analyze the JCR and JEC of a MapReduce cluster on four policy combinations (POCs for short) derived from two typical intermediate-data replication policies and two typical reduce-task assignment policies. The four POCs are further compared in extensive scenarios, which not only consider jobs at different scales with various parameters, but also give a MapReduce cluster two extreme parallel execution capabilities and diverse bandwidths. The analytical results enable MapReduce managers to comprehend how these POCs impact the JCR and JEC of a cluster and then select an appropriate POC based on the characteristics of their own MapReduce jobs and clusters.
引用
收藏
页码:1364 / 1378
页数:15
相关论文
共 50 条
  • [41] Dynamic Job Ordering and Slot Configurations for MapReduce Workloads
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (01) : 4 - 17
  • [42] Performance optimization for short job execution in Hadoop MapReduce
    Gu, Rong
    Yan, Jinshuang
    Yang, Xiaoliang
    Yuan, Chunfeng
    Huang, Yihua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (06): : 1270 - 1280
  • [43] An Investigation into Minimising Total Energy Consumption and Total Completion Time in a Flexible Job Shop for Recycling Carbon Fiber Reinforced Polymer
    Liu, Ying
    Tiwari, Ashutosh
    22ND CIRP CONFERENCE ON LIFE CYCLE ENGINEERING, 2015, 29 : 722 - 727
  • [44] DyScale: A MapReduce Job Scheduler for Heterogeneous Multicore Processors
    Yan, Feng
    Cherkasova, Ludmila
    Zhang, Zhuoyao
    Smirni, Evgenia
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (02) : 317 - 330
  • [45] Hadoop-MapReduce Job Scheduling Algorithms Survey
    Mohamed, Ehab
    Hong, Zheng
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 237 - 242
  • [46] MapReduce Job Performance Tuning by Optimizing Memory Configurations
    Luo Y.-G.
    Chen X.-S.
    Yang L.
    1600, South China University of Technology (45): : 102 - 111
  • [47] Performance Optimization for Short MapReduce Job Execution in Hadoop
    Yan, Jinshuang
    Yang, Xiaoliang
    Gu, Rong
    Yuan, Chunfeng
    Huang, Yihua
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 688 - 694
  • [48] The Reliability of Job Evaluation Rankings
    Ash, Philip
    JOURNAL OF APPLIED PSYCHOLOGY, 1948, 32 (03) : 313 - 320
  • [49] Psychophysiological criteria of job reliability
    Bessonova, Y. V.
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2012, 85 (03) : 372 - 372
  • [50] Job loss expectations and consumption
    不详
    MONTHLY LABOR REVIEW, 2003, 126 (03) : 27 - 27