Dependency-Aware Network Adaptive Scheduling of Data-Intensive Parallel Jobs

被引:9
|
作者
Wang, Shaoqi [1 ]
Chen, Wei [1 ]
Zhou, Xiaobo [1 ]
Zhang, Liqiang [2 ]
Wang, Yin [3 ]
机构
[1] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80918 USA
[2] Indiana Univ, Dept Comp & Informat Sci, South Bend, IN 46615 USA
[3] Tongji Univ, Dept Compute Sci, Shanghai 201804, Peoples R China
基金
美国国家科学基金会;
关键词
Adaptive task scheduler; network adaptive; job dependency; data-parallel clusters; MAPREDUCE;
D O I
10.1109/TPDS.2018.2866993
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Datacenter clusters often run data-intensive jobs in parallel for improving resource utilization and cost efficiency. The performance of parallel jobs is often constrained by the cluster's hard-to-scale network bisection bandwidth. Various solutions have been proposed to address the issue, however, most of them do not consider inter-job data dependencies and schedule jobs independently from one another. In this work, we find that aggregating and co-locating the data and tasks of dependent jobs offer an extra opportunity for data locality improvement that can help to greatly enhance the performance of jobs. We propose and design Dawn, a dependency-aware network-adaptive scheduler that includes an online plan and an adaptive task scheduler. The online plan, taking job dependencies into consideration, determines where (i.e., preferred racks) to place tasks in order to proactively aggregate dependent data. The task scheduler, based on the output of online plan and dynamic network status, adaptively schedules tasks to co-locate with the dependent data in order to take advantage of data locality. We implement Dawn on Apache Yarn and evaluate it on physical and virtual clusters using various machine learning and query workloads. Results show that Dawn effectively improves cluster throughput by up to 73 and 38 percent compared to Fair Scheduler and ShuffleWatcher, respectively. Dawn not only significantly enhances the performance of jobs with dependency, but also works well for jobs without dependency.
引用
收藏
页码:515 / 529
页数:15
相关论文
共 50 条
  • [41] Parallel data-intensive algorithms and applications
    Talia, D
    Srimani, PK
    [J]. PARALLEL COMPUTING, 2002, 28 (05) : 669 - 671
  • [42] A new energy-aware task scheduling method for data-intensive applications in the cloud
    Zhao, Qing
    Xiong, Congcong
    Yu, Ce
    Zhang, Chuanlei
    Zhao, Xi
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 59 : 14 - 27
  • [43] Branch Scheduling: DAG-Aware Scheduling for Speeding up Data-Parallel Jobs
    Hu, Zhiyao
    Li, Dongsheng
    Zhang, Yiming
    Guo, Deke
    Li, Ziyang
    [J]. PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS 2019), 2019,
  • [44] Efficient Semantic-Aware Coflow Scheduling for Data-Parallel Jobs
    Li, Ziyang
    Zhang, Yiming
    Zhao, Yunxiang
    Li, Dongsheng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 154 - 155
  • [45] Dependency-Aware Vehicular Task Scheduling Policy for Tracking Service VEC Networks
    Li, Chao
    Liu, Fagui
    Wang, Bin
    Chen, C. L. Philip
    Tang, Xuhao
    Jiang, Jun
    Liu, Jie
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03): : 2400 - 2414
  • [46] Layer Dependency-Aware Learning Scheduling Algorithms for Containers in Mobile Edge Computing
    Tang, Zhiqing
    Lou, Jiong
    Jia, Weijia
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (06) : 3444 - 3459
  • [47] Dependency-Aware Task Scheduling in TrustZone Empowered Edge Clouds for Makespan Minimization
    Li, Yuepeng
    Zeng, Deze
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (03): : 423 - 434
  • [48] Privacy-Aware Data-Intensive Applications
    Guerriero, Michele
    [J]. PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 1030 - 1033
  • [49] Contextual Dependency-aware Graph Convolutional Network for Extracting Entity Relations
    Liao, Jiahui
    Du, Yajun
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 791 - 798
  • [50] DPPACS: A Novel Data Partitioning and Placement Aware Computation Scheduling Scheme for Data-Intensive Cloud Applications
    Reddy, K. Hemant Kumar
    Roy, Diptendu Sinha
    [J]. COMPUTER JOURNAL, 2016, 59 (01): : 64 - 82