Swallow: Joint Online Scheduling and Coflow Compression in Datacenter Networks

被引:10
|
作者
Zhou, Qihua [1 ]
Li, Peng [2 ]
Wang, Kun [1 ]
Zeng, Deze [3 ]
Guo, Song [4 ]
Guo, Minyi [5 ]
机构
[1] Nanjing Univ Posts & Telecommun, Nanjing, Jiangsu, Peoples R China
[2] Univ Aizu, Aizu Wakamatsu, Fukushima, Japan
[3] China Univ Geosci, Wuhan, Hubei, Peoples R China
[4] Hong Kong Polytech Univ, Hong Kong, Hong Kong, Peoples R China
[5] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
基金
中国博士后科学基金;
关键词
Big Data; Coflow Scheduling; Traffic Compression; Datacenter Networks;
D O I
10.1109/IPDPS.2018.00060
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analytics in datacenters often involves scheduling of data-parallel job, which are bottlenecked by limited bandwidth of datacenter networks. To alleviate the shortage of bandwidth, some existing work has proposed traffic compression to reduce the amount of data transmitted over the network. However, their proposed traffic compression works in a coarse-grained manner at job level, leaving a large optimization space unexplored for further performance improvement. In this paper, we propose a flow-level traffic compression and scheduling system, called Swallow, to accelerate data-intensive applications. Specifically, we target on coflows, which is an elegant abstraction of parallel flows generated by big data jobs. With the objective of minimizing coflow completion time (CCT), we propose a heuristic algorithm called Fastest-Volume-Disposal-First (FVDV) and implement Swallow based on Spark. The results of both trace-driven simulations and real experiments show the superiority of our system, over existing algorithms. Swallow can reduce CCT and job completion time (JCT) by up to 1.47x and 1.66x on average, respectively, over the SEBF in Varys, one of the most efficient coflow scheduling algorithms so far. Moreover, with coflow compression, Swallow reduces data traffic by up to 48.41% on average.
引用
收藏
页码:505 / 514
页数:10
相关论文
共 50 条
  • [21] A Survey of Coflow Scheduling Schemes for Data Center Networks
    Wang, Shuo
    Zhang, Jiao
    Huang, Tao
    Liu, Jiang
    Pan, Tian
    Liu, Yunjie
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (06) : 179 - 185
  • [22] Weaver: Efficient Coflow Scheduling in Heterogeneous Parallel Networks
    Huang, Xin Sunny
    Xia, Yiting
    Ng, T. S. Eugene
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 1071 - 1081
  • [23] Joint Reducer Placement and Coflow Bandwidth Scheduling for Computing Clusters
    Zhao, Yangming
    Tian, Chen
    Fan, Jingyuan
    Guan, Tong
    Zhang, Xiaoning
    Qiao, Chunming
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (01) : 438 - 451
  • [24] RCD: Rapid Close to Deadline Scheduling for Datacenter Networks
    Noormohammadpour, Mohammad
    Raghavendra, Cauligi S.
    Rao, Sriram
    Madni, Asad M.
    2016 WORLD AUTOMATION CONGRESS (WAC), 2016,
  • [25] HybridPass: Hybrid Scheduling for Mixed Flows in Datacenter Networks
    Peng, Bo
    Yao, Jianguo
    Qi, Zhengwei
    Guan, Haibing
    2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 1000 - 1009
  • [26] Efficient Online Scheduling for Coflow-Aware Machine Learning Clusters
    Li, Wenxin
    Chen, Sheng
    Li, Keqiu
    Qi, Heng
    Xu, Renhai
    Zhang, Song
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (04) : 2564 - 2579
  • [27] Joint coflow routing and scheduling in leaf-spine data centers
    Chen, Yang
    Wu, Jie
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 148 : 83 - 95
  • [28] Approximate Multicast Coflow Scheduling in Reconfigurable Data Center Networks
    Wu, Yuhang
    Chen, Quan
    Liu, Jianglong
    Li, Fulong
    Cheng, Lianglun
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT III, 2024, 14489 : 139 - 154
  • [29] Coflow Scheduling With Unknown Prior Information in Data Center Networks
    Wei, Zhe
    Guo, Songtao
    Liu, Guiyan
    Yang, Yuanyuan
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [30] Joint Online Coflow Optimization Across Geo-Distributed Datacenters
    Wu, Zhaoxi
    IEEE ACCESS, 2020, 8 : 213602 - 213610