Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption

被引:8
|
作者
Lin, Jia-Chun [1 ]
Leu, Fang-Yie [2 ]
Chen, Ying-ping [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung, Taiwan
关键词
Intermediate-data replication; job completion reliability; job energy consumption; MapReduce;
D O I
10.1109/TPDS.2014.2374600
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, MapReduce has been widely employed by many companies/organizations to tackle data-intensive problems over a large-scale MapReduce cluster. To solve machine/node failure which is inevitable in a MapReduce cluster, MapReduce employs several policies, such as input-data replication and intermediate-data replication policies. To speed up job execution, MapReduce allows reduce tasks to early fetch their required intermediate data. However, the impact of these policy combinations on the job completion reliability (JCR for short) and job energy consumption (JEC for short) of a MapReduce cluster was not clear, where JCR is the reliability with which a MapReduce job can be completed by the cluster, whereas JEC is the energy consumed by the cluster to complete the job. Therefore, in this study, we analyze the JCR and JEC of a MapReduce cluster on four policy combinations (POCs for short) derived from two typical intermediate-data replication policies and two typical reduce-task assignment policies. The four POCs are further compared in extensive scenarios, which not only consider jobs at different scales with various parameters, but also give a MapReduce cluster two extreme parallel execution capabilities and diverse bandwidths. The analytical results enable MapReduce managers to comprehend how these POCs impact the JCR and JEC of a cluster and then select an appropriate POC based on the characteristics of their own MapReduce jobs and clusters.
引用
收藏
页码:1364 / 1378
页数:15
相关论文
共 50 条
  • [1] Analyzing job completion reliability and job energy consumption for a general MapReduce infrastructure
    Lin, Jia-Chun
    Leu, Fang-Yie
    Chen, Ying-ping
    JOURNAL OF HIGH SPEED NETWORKS, 2013, 19 (03) : 203 - 214
  • [2] Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies
    Jia-Chun Lin
    Fang-Yie Leu
    Ying-Ping Chen
    The Journal of Supercomputing, 2015, 71 : 1657 - 1677
  • [3] Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies
    Lin, Jia-Chun
    Leu, Fang-Yie
    Chen, Ying-Ping
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (05): : 1657 - 1677
  • [4] Deriving Job Completion Reliability and Job Energy Consumption for a General MapReduce Infrastructure from Single-Job Perspective
    Lin, Jia-Chun
    Leu, Fang-Yie
    Lee, Ming-Chang
    Chen, Ying-ping
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 1642 - 1647
  • [5] Impact of MapReduce Task Re-execution Policy on Job Completion Reliability and Job Completion Time
    Lin, Jia-Chun
    Leu, Fang-Yie
    Chen, Ying-ping
    Munawar, Waqaas
    2014 IEEE 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2014, : 712 - 718
  • [6] Predicting Job Completion Time In Heterogeneous MapReduce Environments
    Singhal, Rekha
    Verma, Abhishek
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 17 - 27
  • [7] Minimizing total job completion time in MapReduce scheduling
    Dong, Jianming
    Goebel, Randy
    Hu, Jueliang
    Lin, Guohui
    Su, Bing
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158
  • [8] Minimizing total job completion time in MapReduce scheduling
    Dong, Jianming
    Goebel, Randy
    Hu, Jueliang
    Lin, Guohui
    Su, Bing
    Computers and Industrial Engineering, 2021, 158
  • [9] MapReduce Job Scheduling Based on Remaining Job Sizes
    Matsuki, Tatsuma
    Takine, Tetsuya
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2015, E98B (01) : 180 - 189
  • [10] Curtailing job completion time in MapReduce clouds through improved Virtual Machine allocation
    Shabeera, T. P.
    Kumar, S. D. Madhu
    Chandran, Priya
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 58 : 190 - 202