Optimized scheduling of multi-user Map-Reduce jobs in heterogeneous environment

被引:2
|
作者
Varalakshmi, Perumal [1 ]
Subbiah, Sankari [2 ]
机构
[1] Anna Univ, Dept Comp Technol, Madras Inst Technol, Chennai, Tamil Nadu, India
[2] Sri Sai Ram Engn Coll, Dept Informat Technol, Chennai, Tamil Nadu, India
来源
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2022年 / 34卷 / 27期
关键词
divisible load theory; makespan; MapReduce; predictive partitioning; two level successive partitioning; TREE NETWORKS; MAPREDUCE; STRATEGIES;
D O I
10.1002/cpe.7316
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Map-Reduce is a programming paradigm widely used for data intensive applications in distributed computing. Divisible load theory (DLT) is another model to partition divisible load for parallel processing. In this work, a new job scheduler named virtual job scheduler (VJS) is developed to schedule the MapReduce jobs based on DLT. VJS constructs a virtual job set from the queue of jobs awaiting execution by considering the CPU and IO resource utilization levels of each job. The core of VJS is in its partitioning algorithm. Two novel partitioning algorithms, namely, two level successive partitioning (TLSP) and predictive partitioning (PRED) have been proposed. TLSP applies DLT to both map and reduce phase in succession. This second load partitioning performed during the comparatively longer reduce phase, exploits the advantage of DLT better. PRED is a modification of TLSP aimed at optimizing the overall schedule length, by taking the idle time of the reducers into consideration. Evaluation of these two models across various execution environments has indicated a profound decrease in the wait time of reducers and as a result has shown a significant reduction in the makespan of the whole job as such. VJS altered to incorporate PRED partitioning algorithm has proved to be suitable for heterogeneous environment, with high resource utilization compared with the default Hadoop MapReduce scheduler.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A List Scheduling Algorithm for Scheduling Multi-user Jobs on Clusters
    Barbosa, J.
    Monteiro, A. P.
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2008, 2008, 5336 : 123 - +
  • [2] SIMULATION-BASED MANPOWER PLANNING WITH OPTIMIZED SCHEDULING IN A DISTRIBUTED MULTI-USER ENVIRONMENT
    Kalasky, David
    Coffman, Michael
    De Grano, Melanie
    Field, Kevin
    PROCEEDINGS OF THE 2010 WINTER SIMULATION CONFERENCE, 2010, : 3447 - 3459
  • [3] On Scheduling in Map-Reduce and Flow-Shops
    Moseley, Benjamin
    Dasgupta, Anirban
    Kumar, Ravi
    Sarlos, Tamas
    SPAA 11: PROCEEDINGS OF THE TWENTY-THIRD ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2011, : 289 - 298
  • [4] Scheduling resources in multi-user, heterogeneous, computing environments with SmartNet
    Freund, RF
    Gherrity, M
    Ambrosius, S
    Campbell, M
    Halderman, M
    Hensgen, D
    Keith, E
    Kidd, T
    Kussow, M
    Lima, JD
    Mirabile, F
    Moore, L
    Rust, B
    Siegel, HJ
    SEVENTH HETEROGENEOUS COMPUTING WORKSHOP (HCW '98), 1998, : 184 - 199
  • [5] Optimizing Multiway Joins in a Map-Reduce Environment
    Afrati, Foto N.
    Ullman, Jeffrey D.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (09) : 1282 - 1298
  • [6] Scheduling of parallel jobs in a heterogeneous multi-site environment
    Sabin, G
    Kettimuthu, R
    Rajan, A
    Sadayappan, P
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2003, 2862 : 87 - 104
  • [7] The Map-Reduce Parallelism Framework for Task Scheduling in Grid Computing
    Pei, Yunxia
    Zhang, Yue
    OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 111 - +
  • [8] Performance evaluation for multi-user environment by NOMA and beam-forming with user scheduling
    Suzuki, Motoaki
    Nishimori, Kentaro
    Makino, Hideo
    2016 INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION (ISAP), 2016, : 498 - 499
  • [9] An Optimized Method of Translating SQL to More Efficient Map-reduce Tasks
    Cao, Jin
    Han, Honglin
    Zhao, Mingming
    Ye, Sijing
    Zhu, Dehai
    Li, Lin
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2015, 8 (04): : 249 - 256
  • [10] Efficient Data Layouts for Cost-Optimized Map-Reduce Operations
    Kaur, Narinder
    Taruna, S.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 600 - 604