Self-Learning MapReduce Scheduler in Multi-job Environment

被引:6
|
作者
Lin, Changhang [1 ]
Guo, Wenzhong [1 ]
Lin, Changhui [2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92507 USA
关键词
Hadoop; heterogeneous environment; multi-job; speculative execution; straggler;
D O I
10.1109/CLOUDCOM-ASIA.2013.95
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hadoop, as the most widely adopted open-source implementation of MapReduce framework, makes MapReduce widely accessible. However, it is currently limited by its default MapReduce scheduler. To achieve better performance, the scheduler should take into consideration nodes' computing power and system resources in heterogeneous environment. Further more, from job perspective, tasks' non-linear progress is also an important factor. Some research work has been carried out to enhance the performance of MapReduce, but they are not satisfactory in terms of considering characteristics of both nodes and jobs. To overcome this drawback, we propose a Self-Learning MapReduce Scheduler (SLM), which outperforms the existing schedulers in multi-job environment. Since competitions on system resources may make a task's progress unpredictable, SLM determines the progress of each job based on its own historical information. In particular, on the self-learning stage of a job, with the feedback information from the first few tasks, SLM calculates the task phase weights. With these phase weights, SLM can obtain more accurate execution time estimation, which is the most important condition to finding stragglers (slow tasks). Experimental results show that, SLM can effectively improve the accuracy of execution time estimation and straggler identification, leading to the rational utilization of resources and shortening jobs' execution time especially in multi-job environment.
引用
收藏
页码:610 / 612
页数:3
相关论文
共 50 条
  • [1] Improving Multi-Job MapReduce Scheduling in an Opportunistic Environment
    Ji, Yuting
    Tong, Lang
    He, Ting
    Tan, Jian
    Lee, Kang-won
    Zhang, Li
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 9 - 16
  • [2] Octopus: A Multi-job Scheduler for Graphlab
    Padala, Srikant
    Kumar, Dinesh
    Raj, Arun
    Dharanipragada, Janakiram
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 293 - 298
  • [3] Job Classification for MapReduce Scheduler in Heterogeneous Environment
    Deshmukh, Shyam
    Aghav, J. V.
    Chakravarthy, Rohan
    [J]. 2013 INTERNATIONAL CONFERENCE ON CLOUD & UBIQUITOUS COMPUTING & EMERGING TECHNOLOGIES (CUBE 2013), 2013, : 26 - +
  • [4] PISCES: Optimizing Multi-Job Application Execution in MapReduce
    Chen, Qi
    Yao, Jinyu
    Li, Benchao
    Xiao, Zhen
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2019, 7 (01) : 273 - 286
  • [5] Efficient Device Scheduling with Multi-Job Federated Learning
    Zhou, Chendi
    Liu, Ji
    Jia, Juncheng
    Zhou, Jingbo
    Zhou, Yang
    Dai, Huaiyu
    Dou, Dejing
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9971 - 9979
  • [6] NEW DIMENSIONS IN SELF-LEARNING IN AN SELF-LEARNING ENVIRONMENT OF LEARNING IN REGSEAU
    Fournier, Helene
    Kop, Rita
    [J]. CANADIAN JOURNAL FOR THE STUDY OF ADULT EDUCATION, 2014, 26 (01): : 35 - 55
  • [7] Energy- and locality-efficient multi-job scheduling based on MapReduce for heterogeneous datacenter
    Lei Chen
    Zhao-Hua Liu
    [J]. Service Oriented Computing and Applications, 2019, 13 : 297 - 308
  • [8] Energy- and locality-efficient multi-job scheduling based on MapReduce for heterogeneous datacenter
    Chen, Lei
    Liu, Zhao-Hua
    [J]. SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2019, 13 (04) : 297 - 308
  • [9] Multi-Job Intelligent Scheduling With Cross-Device Federated Learning
    Liu, Ji
    Jia, Juncheng
    Ma, Beichen
    Zhou, Chendi
    Zhou, Jingbo
    Zhou, Yang
    Dai, Huaiyu
    Dou, Dejing
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (02) : 535 - 551
  • [10] DyScale: A MapReduce Job Scheduler for Heterogeneous Multicore Processors
    Yan, Feng
    Cherkasova, Ludmila
    Zhang, Zhuoyao
    Smirni, Evgenia
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (02) : 317 - 330