A Dynamic MapReduce Scheduler for Heterogeneous Workloads

被引:74
|
作者
Tian, Chao [1 ]
Zhou, Haojie [1 ]
He, Yongqiang [1 ]
Zha, Li [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
关键词
MapReduce; Schdule; heterogeneous workloads; MODELS;
D O I
10.1109/GCC.2009.19
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Map Reduce is an important programming model for building data centers containing ten of thousands of nodes. In a practical data center of that scale, it is a common case that I/O-bound jobs and CPU-bound jobs, which demand different resources, run simultaneously in the same cluster. In the MapReduce framework, parallelization of these two kinds of job has not been concerned. In this paper, we give a new view of the MapReduce model, and classify the MapReduce workloads into three categories based on their CPU and I/O utilization. With workload classification, we design a new dynamic MapReduce workload predict mechanism, MR-Predict, which detects the workload type on the fly. We propose a Triple-Queue Scheduler based on the MR-Predict mechanism. The Triple-Queue scheduler could improve the usage of both CPU and disk I/O resources under heterogeneous workloads. And it could improve the Hadoop throughput by about 30% under heterogeneous workloads.
引用
收藏
页码:218 / 224
页数:7
相关论文
共 50 条
  • [1] Improving MapReduce scheduler for heterogeneous workloads in a heterogeneous environment
    Jeyaraj, Rathinaraja
    Ananthanarayana, V. S.
    Paul, Anand
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [2] Improving MapReduce scheduler for heterogeneous workloads in a heterogeneous environment
    Jeyaraj, Rathinaraja
    Ananthanarayana, V. S.
    Paul, Anand
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (17):
  • [3] MapReduce Scheduler Using Classifiers for Heterogeneous Workloads
    Visalakshi, P.
    Karthik, T. U.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (04): : 68 - 73
  • [4] POSUM: A Portfolio Scheduler for MapReduce Workloads
    Voinea, Maria A.
    Uta, Alexandru
    Iosup, Alexandru
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 351 - 357
  • [5] Enhancement of Xen's Scheduler for MapReduce Workloads
    Kang, Hui
    Chen, Yao
    Wong, Jennifer L.
    Sion, Radu
    Wu, Jason
    [J]. HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 251 - 262
  • [6] A Hardware-based HEFT Scheduler Implementation for Dynamic Workloads on Heterogeneous SoCs
    Fusco, Alexander
    Hassan, Sahil
    Mack, Joshua
    Akoglu, Ali
    [J]. PROCEEDINGS OF THE 2022 IFIP/IEEE 30TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2022,
  • [7] Design Dynamic Data Allocation Scheduler to Improve MapReduce Performance in Heterogeneous Clouds
    Yang, Shin-Jer
    Chen, Yi-Ru
    Hsieh, Yung-Ming
    [J]. 2012 NINTH IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2012, : 265 - 270
  • [8] Job Classification for MapReduce Scheduler in Heterogeneous Environment
    Deshmukh, Shyam
    Aghav, J. V.
    Chakravarthy, Rohan
    [J]. 2013 INTERNATIONAL CONFERENCE ON CLOUD & UBIQUITOUS COMPUTING & EMERGING TECHNOLOGIES (CUBE 2013), 2013, : 26 - +
  • [9] An Adaptive MapReduce Scheduler for Scalable Heterogeneous Systems
    Ghoneem, Mohammad
    Kulkarni, Lalit
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 603 - 611
  • [10] Energy-Aware Heuristic Scheduling Using Bin Packing MapReduce Scheduler for Heterogeneous Workloads Performance in Big Data
    S. Aarthee
    R. Prabakaran
    [J]. Arabian Journal for Science and Engineering, 2023, 48 : 1891 - 1905