Improving MapReduce Performance with Partial Speculative Execution

被引:21
|
作者
Wang, Yaoguang [1 ]
Lu, Weiming [1 ]
Lou, Renjie [1 ]
Wei, Baogang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310003, Zhejiang, Peoples R China
关键词
Speculative execution; MapReduce performance; Straggler mitigation; SCHEDULING ALGORITHM;
D O I
10.1007/s10723-015-9350-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The MapReduce framework has become the de facto standard for big data processing due to its attractive features and abilities. One is that it automatically parallelizes a job into multiple tasks and transparently handles task execution on a large cluster of commodity machines. The increasing heterogeneity of distributed environments may result in a few straggling tasks, which prolong job completion. Speculative execution is proposed to mitigate stragglers. However, the existing speculative execution mechanism could not work efficiently as many speculative tasks are still slower than their original tasks. In this paper, we explore an approach to increase the efficiency of speculative execution, and further improve MapReduce performance. We propose the Partial Speculative Execution (PSE) strategy to make speculative tasks start from the checkpoint. By leveraging the checkpoint of original tasks, PSE can eliminate the costs of re-reading, re-copying, and re-computing the processed data. We implement PSE in Hadoop, and evaluate its performance in terms of job completion time and the efficiency of speculative execution under several kinds of classical workloads. Experimental results show that, in heterogeneous environments with stragglers, PSE completes jobs 56 % faster than that with no speculation and 12 % faster than that with LATE, an improved speculative execution algorithm. In addition, on average PSE can improve the efficiency of speculative execution by 24 % compared to LATE.
引用
收藏
页码:587 / 604
页数:18
相关论文
共 50 条
  • [21] Performance Optimization for Short MapReduce Job Execution in Hadoop
    Yan, Jinshuang
    Yang, Xiaoliang
    Gu, Rong
    Yuan, Chunfeng
    Huang, Yihua
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 688 - 694
  • [22] Towards Performance Modeling of Speculative Execution for Cloud Applications
    Nylander, Tommi
    Ruuskanen, Johan
    Arzen, Karl-Erik
    Maggio, Martina
    ICPE'20: COMPANION OF THE ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, 2020, : 17 - 19
  • [23] Improving a Genetic Algorithm for Route Planning Using Parallelism with Speculative Execution
    Mathias, H. David
    Foley, Samantha S.
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [24] Improving Encryption Performance using MapReduce
    Desai, Sanket
    Park, Younghee
    Gao, Jerry
    Chang, Sang-Yoon
    Song, Chungsik
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 1350 - 1355
  • [25] Elastic MapReduce Execution
    Goh, Wei Xiang
    Tan, Kian-Lee
    2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 216 - 225
  • [26] Improving the Execution Performance of FreeSurfer
    Delgado, J.
    Moure, J. C.
    Vives-Gilabert, Y.
    Delfino, M.
    Espinosa, A.
    Gomez-Anson, B.
    NEUROINFORMATICS, 2014, 12 (03) : 413 - 421
  • [27] SPECULARIZER: Detecting Speculative Execution Attacks via Performance Tracing
    Wang, Wubing
    Chen, Guoxing
    Cheng, Yueqiang
    Zhang, Yinqian
    Lin, Zhiqiang
    DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT, DIMVA 2021, 2021, 12756 : 151 - 172
  • [28] IMPROVING THE EFFICIENCY OF SPECULATIVE EXECUTION STRATEGY IN HADOOP USING AMAZON ELASTICACHE FOR REDIS
    Kavitha, C.
    Anita, X.
    Selvan, Shirley
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2021, 16 (06): : 4864 - 4878
  • [29] MrHeter: improving MapReduce performance in heterogeneous environments
    Zhang, Xiao
    Wu, Yanjun
    Zhao, Chen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1691 - 1701
  • [30] MrHeter: improving MapReduce performance in heterogeneous environments
    Xiao Zhang
    Yanjun Wu
    Chen Zhao
    Cluster Computing, 2016, 19 : 1691 - 1701