Dynamic Job Ordering and Slot Configurations for MapReduce Workloads

被引:21
|
作者
Tang, Shanjiang [1 ]
Lee, Bu-Sung [2 ]
He, Bingsheng [2 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
基金
新加坡国家研究基金会;
关键词
MapReduce; Hadoop; flow-shops; scheduling algorithm; job ordering; 2-STAGE;
D O I
10.1109/TSC.2015.2426186
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce is a popular parallel computing paradigm for large-scale data processing in clusters and data centers. A MapReduce workload generally contains a set of jobs, each of which consists of multiple map tasks followed by multiple reduce tasks. Due to 1) that map tasks can only run in map slots and reduce tasks can only run in reduce slots, and 2) the general execution constraints that map tasks are executed before reduce tasks, different job execution orders and map/reduce slot configurations for a MapReduce workload have significantly different performance and system utilization. This paper proposes two classes of algorithms to minimize the makespan and the total completion time for an offline MapReduce workload. Our first class of algorithms focuses on the job ordering optimization for a MapReduce workload under a given map/reduce slot configuration. In contrast, our second class of algorithms considers the scenario that we can perform optimization for map/reduce slot configuration for a MapReduce workload. We perform simulations as well as experiments on Amazon EC2 and show that our proposed algorithms produce results that are up to 15 similar to 80 percent better than currently unoptimized Hadoop, leading to significant reductions in running time in practice.
引用
收藏
页码:4 / 17
页数:14
相关论文
共 50 条
  • [1] Improvement of Makespan and TCTime in Dynamic Job Ordering and Slot Utilization for MapReduce Workloads
    Nagale, Tanmayi
    Lecture Notes in Networks and Systems, 2023, 632 : 95 - 110
  • [2] MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    EURO-PAR 2013 PARALLEL PROCESSING, 2013, 8097 : 291 - 304
  • [3] Analysis of Job Scheduling Algorithms and Studying Dynamic Job Ordering to Optimize MapReduce
    Mohammed, Ahmed Qasim
    Bharati, Rajesh
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS, ICICA 2016, 2018, 632 : 343 - 352
  • [4] FLEX: A Slot Allocation Scheduling Optimizer for MapReduce Workloads
    Wolf, Joel
    Rajan, Deepak
    Hildrum, Kirsten
    Khandekar, Rohit
    Kumar, Vibhore
    Parekh, Sujay
    Wu, Kun-Lung
    Balmin, Andrey
    MIDDLEWARE 2010, 2010, 6452 : 1 - +
  • [5] A Dynamic MapReduce Scheduler for Heterogeneous Workloads
    Tian, Chao
    Zhou, Haojie
    He, Yongqiang
    Zha, Li
    2009 EIGHTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2009, : 218 - 224
  • [6] MapReduce Job Performance Tuning by Optimizing Memory Configurations
    Luo Y.-G.
    Chen X.-S.
    Yang L.
    1600, South China University of Technology (45): : 102 - 111
  • [7] Dynamic Slot Allocation Technique for MapReduce Clusters
    Tang, Shanjiang
    Lee, Bu-Sung
    He, Bingsheng
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [8] Improve Job Ordering And Slot Configuration In Bigdata
    Sadhana, R.
    Gomathi, N.
    Rabeena, S.
    Sailaja, M.
    Sangeetha, V
    2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [9] Parallel job scheduling under dynamic workloads
    Frachtenberg, E
    Feitelson, DG
    Fernandez, J
    Petrini, F
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2003, 2862 : 208 - 227
  • [10] Semantic Characterization of MapReduce Workloads
    Xu, Zhihong
    Hirzel, Martin
    Rothermel, Gregg
    2013 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2013), 2013, : 87 - +