rTuner: A Performance Enhancement of MapReduce Job

被引:1
|
作者
Patgiri, Ripon [1 ]
Das, Rajdeep [1 ]
机构
[1] Natl Inst Technol Silchar, Silchar 788010, Assam, India
关键词
rTuner; MapReduce; Scheduler; Straggler Task; Task Scheduler; Enhancement; Hadoop; Reduce Task; LATE;
D O I
10.1145/3177457.3191710
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present a novel task scheduling algorithm, called rTuner. The key objective of the rTuner is to enhance the reduce task execution time in heterogeneous environments. Because, the reduce task is a very expensive process. The reduce tasks comprise of three phases, unlike to the map task, namely, copy phase, shuffle phase, and reduce phase. Therefore, the rescheduling a straggler reduce task can negatively affect the performance, if the scheduling algorithms does not analyze the underlying situation. The rTuner analyzes the reduce tasks' straggling reason, and tunes the reduce task. If a reduce task becomes straggler, then rTuner reschedules it in a suitable node depending on the situation. Our benchmark result shows that enhancement of reduce tasks boosts up the CPU elapsed time significantly. Moreover, we show the efficacy of the rTuner by extensive experiment in low-cost commodity hardware. The rTuner is able to improve the total job execution time of MapReduce significantly, either a heterogeneous environment or homogeneous environment. The rTuner is capable of reducing the execution time by 86.86 seconds and 100.67 seconds on an average over the Longest Approximate Time to End (LATE) in homogeneous and heterogeneous environment respectively. In addition, the rTuner is also able to improve the execution time by 142.44 seconds and 132.52 seconds over LATE in homogeneous and heterogeneous environment at the best situation respectively.
引用
收藏
页码:176 / 183
页数:8
相关论文
共 50 条
  • [31] Analysis of Job Scheduling Algorithms and Studying Dynamic Job Ordering to Optimize MapReduce
    Mohammed, Ahmed Qasim
    Bharati, Rajesh
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS, ICICA 2016, 2018, 632 : 343 - 352
  • [32] Dynamic Job Ordering and Slot Configurations for MapReduce Workloads
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (01) : 4 - 17
  • [33] Predicting Job Completion Time In Heterogeneous MapReduce Environments
    Singhal, Rekha
    Verma, Abhishek
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 17 - 27
  • [34] Minimizing total job completion time in MapReduce scheduling
    Dong, Jianming
    Goebel, Randy
    Hu, Jueliang
    Lin, Guohui
    Su, Bing
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158
  • [35] Hadoop-MapReduce Job Scheduling Algorithms Survey
    Mohamed, Ehab
    Hong, Zheng
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 237 - 242
  • [36] DyScale: A MapReduce Job Scheduler for Heterogeneous Multicore Processors
    Yan, Feng
    Cherkasova, Ludmila
    Zhang, Zhuoyao
    Smirni, Evgenia
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (02) : 317 - 330
  • [37] Minimizing total job completion time in MapReduce scheduling
    Dong, Jianming
    Goebel, Randy
    Hu, Jueliang
    Lin, Guohui
    Su, Bing
    Computers and Industrial Engineering, 2021, 158
  • [38] Enhancement of Xen's Scheduler for MapReduce Workloads
    Kang, Hui
    Chen, Yao
    Wong, Jennifer L.
    Sion, Radu
    Wu, Jason
    HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 251 - 262
  • [39] AN ANALYTICAL PERFORMANCE MODEL OF MAPREDUCE
    Yang, Xiao
    Sun, Jianling
    2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 306 - 310
  • [40] On the Performance of MapReduce: A Stochastic Approach
    Ahmed, Sarker Tanzir
    Loguinov, Dmitri
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, : 49 - 54