EPPADS: An Enhanced Phase-Based Performance-Aware Dynamic Scheduler for High Job Execution Performance in Large Scale Clusters

被引:2
|
作者
Hamandawana, Prince [1 ]
Mativenga, Ronnie [1 ]
Kwon, Se Jin [2 ]
Chung, Tae-Sun [1 ]
机构
[1] Ajou Univ, Suwon 16499, South Korea
[2] Kangwon Natl Univ, Chunchon 24341, Gangwon, South Korea
基金
新加坡国家研究基金会;
关键词
Distributed processing; Scheduling; MapReduce; MAPREDUCE PERFORMANCE;
D O I
10.1007/978-3-030-18576-3_9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The way in which jobs are scheduled is critical to achieve high job processing performance in large scale data clusters. Most existing scheduling mechanism employs a First-In First-Out, serialized approach encompassed with task straggler hunting techniques which launches speculative tasks after detecting slow tasks. This is often achieved through the instrumentation of processing nodes. Such node instrumentation incurs frequent communication overheads as the number of processing nodes increase. Moreover the sequential scheduling of job tasks and the straggler hunting approach fails to meet optimal performance as they increase job waiting time in queue and incurs delayed speculative execution of straggling tasks respectively. In this paper we propose an Enhanced Phase based Performance Aware Dynamic Scheduler (EPPADS), which schedules job tasks without additional instrumentation modules. EPPADS uses a two staged scheduling approach, that is, the slow start phase (SSP) and accelerate phase (AccP). The SSP schedules the initial task in the queue in the normal FIFO way and records the initial execution times of the processing nodes. The AccP uses the initial execution times to compute the processing nodes task distribution ratio of the remaining tasks and schedules them using a single scheduling I/O. We implement EPPADS scheduler in Hadoop's MapReduce framework. Our evaluation shows that EPPADS can achieve a performance improvement on FIFO scheduler of 30%. Compared with existing Dynamic scheduling approach which uses node instrumentation, EPPADS achieves a better performance of 22%.
引用
收藏
页码:140 / 156
页数:17
相关论文
共 50 条
  • [1] Performance-Aware Speculative Resource Oversubscription for Large-Scale Clusters
    Yang, Renyu
    Hu, Chunming
    Sun, Xiaoyang
    Garraghan, Peter
    Wo, Tianyu
    Wen, Zhenyu
    Peng, Hao
    Xu, Jie
    Li, Chao
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (07) : 1499 - 1517
  • [2] Towards an Energy Efficient Computing With Coordinated Performance-Aware Scheduling in Large Scale Data Clusters
    Hamandawana, Prince
    Mativenga, Ronnie
    Kwon, Se Jin
    Chung, Tae-Sun
    [J]. IEEE ACCESS, 2019, 7 : 140261 - 140277
  • [3] Energy-aware job scheduler for high-performance computing
    Mammela, Olli
    Majanen, Mikko
    Basmadjian, Robert
    De Meer, Hermann
    Giesler, Andre
    Homberg, Willi
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2012, 27 (04): : 265 - 275
  • [4] Performance-aware Malware Epidemic Confinement in Large-Scale IoT Networks
    Hassan, Rakibul
    Rafatirad, Setareh
    Homayoun, Houman
    Dinakarrao, Sai Manoj Pudukotai
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [5] A Network-aware Scheduler in Data-parallel Clusters for High Performance
    Li, Zhuozhao
    Shen, Haiying
    Sarker, Ankur
    [J]. 2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 1 - 10
  • [6] A performance-aware dynamic scheduling algorithm for cloud-based IoT applications
    Pandiyan, Sanjeevi
    Lawrence, T. Samraj
    Sathiyamoorthi, V
    Ramasamy, Manikandan
    Xia, Qian
    Guo, Ya
    [J]. COMPUTER COMMUNICATIONS, 2020, 160 : 512 - 520
  • [7] Performance-aware Lightweight Dynamic Early-Exit-based Gait Authentication
    Zouridakis, Pavlos
    Dinakarrao, Sai Manoj Pudukotai
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 466 - 470
  • [8] PADS: Performance-Aware Dynamic Scheduling for effective MapReduce Computation in Heterogeneous Clusters Poster extended abstract
    Hamandawana, Prince
    Mativenga, Ronnie
    Kwon, Se Jin
    Chung, Tae-Sun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 160 - 161
  • [9] FEASIBILITY ANALYSIS OF USING THE MAUI SCHEDULER FOR JOB SIMULATION OF LARGE-SCALE PBS BASED CLUSTERS
    Zitzlsberger, Georg
    Jansik, Branislav
    Martinovic, Jan
    [J]. IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2018, 13 (02): : 47 - 61
  • [10] Thermal-aware Job Scheduling of MapReduce Applications on High Performance Clusters
    Taneja, Shubbhi
    Zhou, Yi
    Alghamdi, Mohammed I.
    Qin, Xiao
    [J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW), 2017, : 261 - 270