A Dynamic MapReduce Scheduler for Heterogeneous Workloads

被引:74
|
作者
Tian, Chao [1 ]
Zhou, Haojie [1 ]
He, Yongqiang [1 ]
Zha, Li [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
关键词
MapReduce; Schdule; heterogeneous workloads; MODELS;
D O I
10.1109/GCC.2009.19
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Map Reduce is an important programming model for building data centers containing ten of thousands of nodes. In a practical data center of that scale, it is a common case that I/O-bound jobs and CPU-bound jobs, which demand different resources, run simultaneously in the same cluster. In the MapReduce framework, parallelization of these two kinds of job has not been concerned. In this paper, we give a new view of the MapReduce model, and classify the MapReduce workloads into three categories based on their CPU and I/O utilization. With workload classification, we design a new dynamic MapReduce workload predict mechanism, MR-Predict, which detects the workload type on the fly. We propose a Triple-Queue Scheduler based on the MR-Predict mechanism. The Triple-Queue scheduler could improve the usage of both CPU and disk I/O resources under heterogeneous workloads. And it could improve the Hadoop throughput by about 30% under heterogeneous workloads.
引用
收藏
页码:218 / 224
页数:7
相关论文
共 50 条
  • [41] Workload characteristic oriented scheduler for MapReduce
    Lu, Peng
    Lee, Young Choon
    Wang, Chen
    Zhou, Bing Bing
    Chen, Junliang
    Zomaya, Albert Y.
    [J]. Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, 2012, : 156 - 163
  • [42] MapReduce Scheduler by Characterizing Performance Interference
    Lei Yang
    Yu Dai
    Bin Zhang
    [J]. China Communications, 2016, 13 (10) : 253 - 262
  • [43] OFScheduler: A Dynamic Network Optimizer for MapReduce in Heterogeneous Cluster
    Li, Zhao
    Shen, Yao
    Yao, Bin
    Guo, Minyi
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2015, 43 (03) : 472 - 488
  • [44] OFScheduler: A Dynamic Network Optimizer for MapReduce in Heterogeneous Cluster
    Zhao Li
    Yao Shen
    Bin Yao
    Minyi Guo
    [J]. International Journal of Parallel Programming, 2015, 43 : 472 - 488
  • [45] Towards a Realistic Scheduler for Mixed Workloads with Workflows
    Ilyushkin, Alexey
    Epema, Dick
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 753 - 756
  • [46] Pareto-based Scheduling of MapReduce Workloads
    Zacheilas, Nikos
    Kalogeraki, Vana
    [J]. 2016 IEEE 19TH INTERNATIONAL SYMPOSIUM ON REAL-TIME DISTRIBUTED COMPUTING (ISORC 2016), 2016, : 174 - 181
  • [47] Decoupled Strategy for Imbalanced Workloads in MapReduce Frameworks
    Rivas-Gomez, Sergio
    Narasimhamurthy, Sai
    Brabazon, Keeran
    Perks, Oliver
    Laure, Erwin
    Markidis, Stefano
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 921 - 927
  • [48] Performance Modelling and Analysis of MapReduce/Hadoop Workloads
    Yu, Xiaolong
    Li, Wei
    [J]. 2015 IEEE 21ST INTERNATIONAL WORKSHOP ON LOCAL & METROPOLITAN AREA NETWORKS (LANMAN), 2015,
  • [49] Performance Analysis of Coupling Scheduler for MapReduce/Hadoop
    Tan, Jian
    Meng, Xiaoqiao
    Zhang, Li
    [J]. 2012 PROCEEDINGS IEEE INFOCOM, 2012, : 2586 - 2590
  • [50] Low Latency and High Data Rate (LLHD) Scheduler: A Multipath TCP Scheduler for Dynamic and Heterogeneous Networks
    Lubna, Tabassum
    Mahmud, Imtiaz
    Cho, You-Ze
    [J]. SENSORS, 2022, 22 (24)