Dependency-Aware Network Adaptive Scheduling of Data-Intensive Parallel Jobs

被引:9
|
作者
Wang, Shaoqi [1 ]
Chen, Wei [1 ]
Zhou, Xiaobo [1 ]
Zhang, Liqiang [2 ]
Wang, Yin [3 ]
机构
[1] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80918 USA
[2] Indiana Univ, Dept Comp & Informat Sci, South Bend, IN 46615 USA
[3] Tongji Univ, Dept Compute Sci, Shanghai 201804, Peoples R China
基金
美国国家科学基金会;
关键词
Adaptive task scheduler; network adaptive; job dependency; data-parallel clusters; MAPREDUCE;
D O I
10.1109/TPDS.2018.2866993
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Datacenter clusters often run data-intensive jobs in parallel for improving resource utilization and cost efficiency. The performance of parallel jobs is often constrained by the cluster's hard-to-scale network bisection bandwidth. Various solutions have been proposed to address the issue, however, most of them do not consider inter-job data dependencies and schedule jobs independently from one another. In this work, we find that aggregating and co-locating the data and tasks of dependent jobs offer an extra opportunity for data locality improvement that can help to greatly enhance the performance of jobs. We propose and design Dawn, a dependency-aware network-adaptive scheduler that includes an online plan and an adaptive task scheduler. The online plan, taking job dependencies into consideration, determines where (i.e., preferred racks) to place tasks in order to proactively aggregate dependent data. The task scheduler, based on the output of online plan and dynamic network status, adaptively schedules tasks to co-locate with the dependent data in order to take advantage of data locality. We implement Dawn on Apache Yarn and evaluate it on physical and virtual clusters using various machine learning and query workloads. Results show that Dawn effectively improves cluster throughput by up to 73 and 38 percent compared to Fair Scheduler and ShuffleWatcher, respectively. Dawn not only significantly enhances the performance of jobs with dependency, but also works well for jobs without dependency.
引用
收藏
页码:515 / 529
页数:15
相关论文
共 50 条
  • [1] Network-Adaptive Scheduling of Data-Intensive Parallel Jobs with Dependencies in Clusters
    Wang, Shaoqi
    Zhou, Xiaobo
    Zhang, Liqiang
    Jiang, Changjun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC COMPUTING (ICAC), 2017, : 155 - 160
  • [2] GRAPHENE: Packing and Dependency-aware Scheduling for Data-Parallel Clusters
    Grandl, Robert
    Kandula, Srikanth
    Rao, Sriram
    Akella, Aditya
    Kulkarni, Janardhan
    [J]. PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2016, : 81 - 97
  • [3] DAScheduler: Dependency-Aware Scheduling Algorithm for Containerized Dependent Jobs
    Alelyani, Abdullah
    Datta, Amitava
    Hassan, Ghulam Mubashar
    [J]. JOURNAL OF GRID COMPUTING, 2023, 21 (03)
  • [4] DAScheduler: Dependency-Aware Scheduling Algorithm for Containerized Dependent Jobs
    Abdullah Alelyani
    Amitava Datta
    Ghulam Mubashar Hassan
    [J]. Journal of Grid Computing, 2023, 21
  • [5] Dependency-aware and Resource-efficient Scheduling for Heterogeneous Jobs in Clouds
    Liu, Jinwei
    Shen, Haiying
    [J]. 2016 8TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2016), 2016, : 110 - 117
  • [6] Parallel Scheduling of Data-Intensive Tasks
    Meng, Xiao
    Golab, Lukasz
    [J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 117 - 133
  • [7] Locality and Network-Aware Reduce Task Scheduling for Data-Intensive Applications
    Arslan, Engin
    Shekhar, Mrigank
    Kosar, Tevfik
    [J]. 2014 5TH INTERNATIONAL WORKSHOP ON DATA-INTENSIVE COMPUTING IN THE CLOUDS (DATACLOUD), 2014, : 17 - 24
  • [8] Sharing-Aware InterCloud Scheduler for Data-Intensive Jobs
    Mehdi, Nawfal A.
    Holmes, Bryn
    Mamat, Ali
    Subramaniam, Shamala K.
    [J]. 2012 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES, APPLICATIONS AND MANAGEMENT (ICCCTAM), 2012, : 22 - 26
  • [9] Scheduling file transfers for data-intensive jobs on heterogeneous clusters
    Khanna, Gaurav
    Catalyurek, Umit
    Kurc, Tahsin
    Sadayappan, P.
    Saltz, Joel
    [J]. EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 214 - +
  • [10] Dependency-Aware Data Locality for MapReduce
    Fan, Xiaoyi
    Ma, Xiaoqiang
    Liu, Jiangchuan
    Li, Dan
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 409 - 416