A Learning-based MapReduce Scheduler in Heterogeneous Environments

被引:0
|
作者
Naik, Nenavath Srinivas [1 ,2 ]
Negi, Atul [1 ]
机构
[1] Univ Hyderabad, Sch Comp & Informat Sci, Hyderabad, Andhra Pradesh, India
[2] CVR Coll Engn, Dept Comp Sci & Engn, Hyderabad, Andhra Pradesh, India
关键词
MapReduce; Reinforcement learning; Data Locality; Stragglers; Heterogeneous environment; LOCALITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce is an essential framework for distributed storage and parallel processing for large-scale data intensive jobs proposed in recent times. Hadoop default scheduler assumes a homogeneous environment. This assumption of homogeneity does not work at all times in practice and limits the performance of MapReduce. In heterogeneous environments, the job completion times do not synchronize. Data locality is essentially moving computation closer (faster access) to the input data. Fundamentally, MapReduce does not always look into the heterogeneity from a data locality perspective. Improving data locality for MapReduce framework is an important issue to enhance the performance of heterogeneous Hadoop clusters. Learning based scheduling decisions can potentially help in significantly reducing the overall job execution time. In this paper, we provide an overview of the taxonomy for MapReduce schedulers. This paper proposes a novel hybrid scheduler using a Reinforcement learning based approach. The proposed scheduler identifies the true Straggler tasks and schedules these tasks on fast processing nodes in a heterogeneous Hadoop cluster by taking the data locality into account.
引用
收藏
页码:2020 / 2025
页数:6
相关论文
共 50 条
  • [1] A data locality based scheduler to enhance MapReduce performance in heterogeneous environments
    Naik, Nenavath Srinivas
    Negi, Atul
    Bapu, Tapas B. R.
    Anitha, R.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 90 : 423 - 434
  • [2] TMaR: a two-stage MapReduce scheduler for heterogeneous environments
    Maleki, Neda
    Faragardi, Hamid Reza
    Rahmani, Amir Masoud
    Conti, Mauro
    Lofstead, Jay
    [J]. HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2020, 10 (01)
  • [3] A Usage-Aware Scheduler for Improving MapReduce Performance in Heterogeneous Environments
    Hsiao, J. H.
    Kao, S. J.
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1647 - +
  • [4] Enhancing the Performance of MapReduce Default Scheduler by Detecting Prolonged TaskTrackers in Heterogeneous Environments
    Naik, Nenavath Srinivas
    Negi, Atul
    Sastry, V. N.
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 2, 2016, 380 : 225 - 233
  • [5] A Dynamic MapReduce Scheduler for Heterogeneous Workloads
    Tian, Chao
    Zhou, Haojie
    He, Yongqiang
    Zha, Li
    [J]. 2009 EIGHTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2009, : 218 - 224
  • [6] Improving MapReduce scheduler for heterogeneous workloads in a heterogeneous environment
    Jeyaraj, Rathinaraja
    Ananthanarayana, V. S.
    Paul, Anand
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [7] Improving MapReduce scheduler for heterogeneous workloads in a heterogeneous environment
    Jeyaraj, Rathinaraja
    Ananthanarayana, V. S.
    Paul, Anand
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (17):
  • [8] Job Classification for MapReduce Scheduler in Heterogeneous Environment
    Deshmukh, Shyam
    Aghav, J. V.
    Chakravarthy, Rohan
    [J]. 2013 INTERNATIONAL CONFERENCE ON CLOUD & UBIQUITOUS COMPUTING & EMERGING TECHNOLOGIES (CUBE 2013), 2013, : 26 - +
  • [9] An Adaptive MapReduce Scheduler for Scalable Heterogeneous Systems
    Ghoneem, Mohammad
    Kulkarni, Lalit
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 603 - 611
  • [10] MapReduce Scheduler Using Classifiers for Heterogeneous Workloads
    Visalakshi, P.
    Karthik, T. U.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (04): : 68 - 73