An Optimal Task Selection Scheme for Hadoop Scheduling

被引:11
|
作者
Suresh, S. [1 ]
Gopalan, N. P. [1 ]
机构
[1] Natl Inst Technol, Dept Comp Applicat, Tiruchirappalli 632015, Tamil Nadu, India
关键词
Hadoop; MapReduce; task selection; job scheduling;
D O I
10.1016/j.ieri.2014.09.093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce is a popular parallel programming model used to solve wide range of BigData applications in cloud computing environment. Hadoop is an open source implementation MapReduce and widely used by vast amount of users. It provides an abstracted environment for running large scale data intensive applications in a scalable and fault tolerant manner. There are several Hadoop scheduling algorithms are proposed in the literature with various performance goals. In this paper, a new optimal task selection scheme is introduced in to assist the scheduler when multiple local tasks are available for a node. To improve the probability of percentage of local tasks launched for a job in future, the task which has least number of replicas of input, individual load of disks attached to the node and maximum expected time to wait for next local node is launched among the available local tasks for a node. The proposed method was evaluated by extensive experiments and it has been observed that the method improves the performance significantly. From the experiments, around 20% of improvements achieved in terms of locality and fairness. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:70 / 75
页数:6
相关论文
共 50 条
  • [1] A Task Scheduling Algorithm for Hadoop Platform
    Chen, Jilan
    Wang, Dan
    Zhao, Wenbing
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (04) : 929 - 936
  • [2] Research on Scheduling Scheme for Hadoop clusters
    Xie, Jiong
    Meng, FanJun
    Wang, HaiLong
    Pan, HongFang
    Cheng, JinHong
    Qin, Xiao
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 2468 - 2471
  • [3] Residual Traffic Based Task Scheduling in Hadoop
    Tanaka, Daichi
    Kawarasaki, Masatoshi
    [J]. CLOUD COMPUTING 2015: THE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION, 2015, : 94 - 102
  • [4] Hadoop Job Scheduling with Dynamic Task Splitting
    Xu, YongLiang
    Cai, Wentong
    [J]. 2015 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING RESEARCH AND INNOVATION (ICCCRI), 2015, : 120 - 129
  • [5] Implementation and Evaluation of The JobTracker Initiative Task Scheduling on Hadoop
    Yamazaki, Kazuki
    Kawashima, Ryota
    Saito, Shoichi
    Matsuo, Hiroshi
    [J]. 2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 622 - 626
  • [6] Load balancing task scheduling algorithm in Hadoop platform
    Cai Yandong
    Liu Yan
    Zhang Qinglei
    [J]. 2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 605 - 608
  • [7] RGV Optimal Scheduling Scheme Selection in Multi-Process Scenario
    Luo, Shuyu
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS III, 2019, 2073
  • [8] An improved task assignment scheme for Hadoop running in the clouds
    Dai, Wei
    Bassiouni, Mostafa
    [J]. JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2013, 2 (02): : 1 - 16
  • [9] HPCA: A Node Selection and Scheduling Method for Hadoop MapReduce
    Archana, G. K.
    Chakravarthy, V. Deeban
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATIONS TECHNOLOGIES (ICCCT 15), 2015, : 368 - 372
  • [10] A Dynamic and Failure-Aware Task Scheduling Framework for Hadoop
    Soualhia, Mbarka
    Khomh, Foutse
    Tahar, Sofiene
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 553 - 569