TMaR: a two-stage MapReduce scheduler for heterogeneous environments

被引:8
|
作者
Maleki, Neda [1 ]
Faragardi, Hamid Reza [2 ]
Rahmani, Amir Masoud [3 ]
Conti, Mauro [4 ]
Lofstead, Jay [5 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Sci & Res Branch, Tehran, Iran
[2] KTH Royal Inst Technol, Dept Comp Sci & Commun, Stockholm, Sweden
[3] Khazar Univ, Dept Comp Sci, Baku, Azerbaijan
[4] Univ Padua, Dept Math, Padua, Italy
[5] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
关键词
MapReduce; Hadoop; Heterogeneous systems; Scheduling; Performance; Shuffling; Power; Cloud computing; LOCALITY-AWARE; MAKESPAN; ALGORITHMS; SYSTEMS; TIME;
D O I
10.1186/s13673-020-00247-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of MapReduce task scheduling, many algorithms mainly focus on the scheduling of Reduce tasks with the assumption that scheduling of Map tasks is already done. However, in the cloud deployments of MapReduce, the input data is located on remote storage which indicates the importance of the scheduling of Map tasks as well. In this paper, we propose a two-stage Map and Reduce task scheduler for heterogeneous environments, called TMaR. TMaR schedules Map and Reduce tasks on the servers that minimize the task finish time in each stage, respectively. We employ a dynamic partition binder for Reduce tasks in the Reduce stage to lighten the shuffling traffic. Indeed, TMaR minimizes the makespan of a batch of tasks in heterogeneous environments while considering the network traffic. The simulation results demonstrate that TMaR outperforms Hadoop-stock and Hadoop-A in terms of makespan and network traffic and achieves by an average of 29%, 36%, and 14% performance using Wordcount, Sort, and Grep benchmarks. Besides, the power reduction of TMaR is up to 12%.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] A two-stage regularization framework for heterogeneous event networks
    dos Santos, Brucce Neves
    Rossi, Rafael Geraldeli
    Rezende, Solange Oliveira
    Marcacini, Ricardo Marcondes
    PATTERN RECOGNITION LETTERS, 2020, 138 : 490 - 496
  • [22] A two-stage prediction model for heterogeneous effects of treatments
    Chalkou, Konstantina
    Steyerberg, Ewout
    Egger, Matthias
    Manca, Andrea
    Pellegrini, Fabio
    Salanti, Georgia
    STATISTICS IN MEDICINE, 2021, 40 (20) : 4362 - 4375
  • [23] Heterogeneous defect prediction with two-stage ensemble learning
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Zhang, Hongyu
    Xu, Baowen
    Ying, Shi
    AUTOMATED SOFTWARE ENGINEERING, 2019, 26 (03) : 599 - 651
  • [24] MrHeter: improving MapReduce performance in heterogeneous environments
    Zhang, Xiao
    Wu, Yanjun
    Zhao, Chen
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1691 - 1701
  • [25] MrHeter: improving MapReduce performance in heterogeneous environments
    Xiao Zhang
    Yanjun Wu
    Chen Zhao
    Cluster Computing, 2016, 19 : 1691 - 1701
  • [26] Enhancing Performance of MapReduce Framework in Heterogeneous Environments
    Naik, Nenavath Srinivas
    Negi, Atul
    Sastry, V. N.
    2015 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS (ADCOM), 2015, : 51 - 54
  • [27] Global dynamics of a two-stage structured diffusive population model in time-periodic and spatially heterogeneous environments
    Gueguezo, H. M.
    Doumate, T. J.
    Salako, R. B.
    STUDIES IN APPLIED MATHEMATICS, 2024,
  • [28] A Two-stage Multi-population Genetic Algorithm with Heuristics for Workflow Scheduling in Heterogeneous Distributed Computing Environments
    Xie, Yi
    Gui, Feng-Xian
    Wang, Wei-Jun
    Chien, Chen-Fu
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2023, 11 (02) : 1446 - 1460
  • [29] Design Dynamic Data Allocation Scheduler to Improve MapReduce Performance in Heterogeneous Clouds
    Yang, Shin-Jer
    Chen, Yi-Ru
    Hsieh, Yung-Ming
    2012 NINTH IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2012, : 265 - 270
  • [30] Burstiness-aware I/O Scheduler for MapReduce Framework on Virtualized Environments
    Kim, Sewoog
    Kang, Dongwoo
    Choi, Jongmoo
    Kim, Junmo
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 305 - 308