Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies

被引:8
|
作者
Lin, Jia-Chun [1 ]
Leu, Fang-Yie [2 ]
Chen, Ying-Ping [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung 40704, Taiwan
来源
JOURNAL OF SUPERCOMPUTING | 2015年 / 71卷 / 05期
关键词
MapReduce; Hadoop; Job completion reliability; Job energy consumption; Intermediate data; Replication;
D O I
10.1007/s11227-014-1286-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, MapReduce has been a popular distributed programming framework for solving data-intensive applications. However, a large-scale MapReduce cluster has inevitable machine/node failures and considerable energy consumption. To solve these problems, MapReduce has employed several policies for replicating input data, storing/replicating intermediate data, and re-executing failed tasks. In this study, we concentrate on two typical policies for storing/replicating intermediate data, and derive the job completion reliability (JCR for short) and job energy consumption (JEC for short) of a MapReduce cluster when the two policies are individually employed. The two policies are further analyzed and compared given various scenarios in which jobs with different input data sizes, numbers of reduce tasks, and other parameters are run in a MapReduce cluster with two extreme parallel execution capabilities. From the analytical results, MapReduce managers are able to comprehend how the two policies influence the JCR and JEC of a MapReduce cluster.
引用
收藏
页码:1657 / 1677
页数:21
相关论文
共 5 条
  • [1] Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies
    Jia-Chun Lin
    Fang-Yie Leu
    Ying-Ping Chen
    The Journal of Supercomputing, 2015, 71 : 1657 - 1677
  • [2] Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption
    Lin, Jia-Chun
    Leu, Fang-Yie
    Chen, Ying-ping
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (05) : 1364 - 1378
  • [3] Analyzing job completion reliability and job energy consumption for a general MapReduce infrastructure
    Lin, Jia-Chun
    Leu, Fang-Yie
    Chen, Ying-ping
    JOURNAL OF HIGH SPEED NETWORKS, 2013, 19 (03) : 203 - 214
  • [4] Deriving Job Completion Reliability and Job Energy Consumption for a General MapReduce Infrastructure from Single-Job Perspective
    Lin, Jia-Chun
    Leu, Fang-Yie
    Lee, Ming-Chang
    Chen, Ying-ping
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 1642 - 1647
  • [5] Spatio-temporal thermal-aware job scheduling to minimize energy consumption in virtualized heterogeneous data centers
    Mukherjee, Tridib
    Banerjee, Ayan
    Varsamopoulos, Georgios
    Gupta, Sandeep K. S.
    Rungta, Sanjay
    COMPUTER NETWORKS, 2009, 53 (17) : 2888 - 2904