Endpoint-Flexible Coflow Scheduling Across Geo-Distributed Datacenters

被引:13
|
作者
Li, Wenxin [1 ]
Yuan, Xu [2 ]
Li, Keqiu [3 ]
Qi, Heng [4 ]
Zhou, Xiaobo [3 ]
Xu, Renhai [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Univ Louisiana Lafayette, Sch Comp & Informat, Lafayette, LA 70503 USA
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin Key Lab Adv Networking TANK, Tianjin 300350, Peoples R China
[4] Dalian Univ Technol, Sch Comp Sci & Technol, 2 Linggong Rd, Dalian 116023, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Bandwidth; Scheduling; Heuristic algorithms; Distributed databases; Approximation algorithms; Data models; Inter-datacenter; coflow scheduling; CCT; deadline; endpoint flexibility;
D O I
10.1109/TPDS.2020.2992615
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Over the last decade, we have witnessed growing data volumes generated and stored across geographically distributed datacenters. Processing such geo-distributed datasets may suffer from significant slowdown as the underlying network flows have to go through the inter-datacenter networks with relatively low and highly heterogeneous available link bandwidth. Thus, optimizing the transmissions of inter-datacenter flows, especially coflows that capture application-level semantics, is important for improving the communication performance of such geo-distributed applications. However, prior solutions on coflow scheduling have significant limitations: they schedule coflows with already-fixed endpoints of flows, making them insufficient to optimize the coflow completion time (CCT). In this article, we focus on the problem of jointly considering endpoint placement and coflow scheduling to minimize the average CCT of coflows across geo-distributed datacenters. To solve this problem without any prior knowledge of coflow arrivals, we present a coflow-aware optimization framework called SmartCoflow. In SmartCoflow, we first apply an approximate algorithm to obtain the endpoint placement and scheduling decisions for a single coflow. Based on the single-coflow solution, we then develop an efficient online algorithm to handle the dynamically arrived coflows. Through rigorous theoretical analysis, we prove that SmartCoflow has a non-trivial competitive ratio. We also extend SmartCoflow to incorporate various design choices or requirements of applications and operators, such as enforcing an inter-datacenter bandwidth usage budget and considering coflow deadline. Through experimental results from testbed implementation and trace-driven simulations, we demonstrate that SmartCoflow can reduce the average CCT, lower bandwidth usage, and improve coflow deadline meet rate, when compared to the state-of-the-art scheduling-only method.
引用
收藏
页码:2466 / 2481
页数:16
相关论文
共 50 条
  • [1] Leveraging Endpoint Flexibility When Scheduling Coflows across Geo-distributed Datacenters
    Li, Wenxin
    Yuan, Xu
    Li, Keqiu
    Qi, Heng
    Zhou, Xiaobo
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2018), 2018, : 873 - 881
  • [2] Joint Online Coflow Optimization Across Geo-Distributed Datacenters
    Wu, Zhaoxi
    [J]. IEEE ACCESS, 2020, 8 : 213602 - 213610
  • [3] Scheduling Jobs Across Geo-distributed Datacenters
    Hung, Chien-Chun
    Golubchik, Leana
    Yu, Minlan
    [J]. ACM SOCC'15: PROCEEDINGS OF THE SIXTH ACM SYMPOSIUM ON CLOUD COMPUTING, 2015, : 111 - 124
  • [4] Flutter: Scheduling Tasks Closer to Data Across Geo-Distributed Datacenters
    Hu, Zhiming
    Li, Baochun
    Luo, Jun
    [J]. IEEE INFOCOM 2016 - THE 35TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, 2016,
  • [5] Scheduling Jobs across Geo-Distributed Datacenters with Max-Min Fairness
    Chen, Li
    Liu, Shuhao
    Li, Baochun
    Li, Bo
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2019, 6 (03): : 488 - 500
  • [6] Scheduling Jobs across Geo-Distributed Datacenters with Max-Min Fairness
    Chen, Li
    Liu, Shuhao
    Li, Baochun
    Li, Bo
    [J]. IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [7] MAST: Global Scheduling of ML Training across Geo-Distributed Datacenters at Hyperscale
    Choudhury, Arnab
    Wang, Yang
    Pelkonen, Tuomas
    Srinivasan, Kutta
    Jain, Abha
    Lin, Shenghao
    David, Delia
    Soleimanifard, Siavash
    Chen, Michael
    Yadav, Abhishek
    Tijoriwala, Ritesh
    Samoylov, Denis
    Tang, Chunqiang
    [J]. PROCEEDINGS OF THE 18TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2024, 2024, : 563 - 580
  • [8] Calantha: Content Distribution across Geo-Distributed Datacenters
    Li, Yangyang
    Zhang, Linchao
    Jia, Yue
    Liao, Yong
    Xie, Haiyong
    [J]. 2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 724 - 729
  • [9] Orchestrating Bulk Data Transfers across Geo-Distributed Datacenters
    Wu, Yu
    Zhang, Zhizhong
    Wu, Chuan
    Guo, Chuanxiong
    Li, Zongpeng
    Lau, Francis C. M.
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2017, 5 (01) : 112 - 125
  • [10] On efficient virtual cluster scaling across geo-distributed datacenters
    Xu, Xinping
    Li, Wenxin
    Qi, Heng
    Li, Keqiu
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (10):