Scheduling Jobs across Geo-Distributed Datacenters with Max-Min Fairness

被引:30
|
作者
Chen, Li [1 ]
Liu, Shuhao [1 ]
Li, Baochun [1 ]
Li, Bo [2 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S3G4, Canada
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Geo-distributed datacenter networks; wide-area big data analytics; scheduling; fairness;
D O I
10.1109/TNSE.2018.2795580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
It has become routine for large volumes of data to be generated, stored, and processed across geographically distributed datacenters. To run a single data analytic job on such geo-distributed data, recent research proposed to distribute its tasks across datacenters, considering both data locality and network bandwidth across datacenters. Yet, it remains an open problem in the more general case, where multiple analytic jobs need to fairly share the resources at these geo-distributed datacenters. In this paper, we focus on the problem of assigning tasks belonging to multiple jobs across datacenters, with the specific objective of achieving max-min fairness across jobs sharing these datacenters, in terms of their job completion times. We formulate this problem as a lexicographical minimization problem, which is challenging to solve in practice due to its inherent multi-objective and discrete nature. To address these challenges, we iteratively solve its single-objective subproblems, which can be transformed to equivalent linear programming (LP) problems to be efficiently solved, thanks to their favorable properties. As a highlight of this paper, we have designed and implemented our proposed solution as a fair job scheduler based on Apache Spark, a modern data processing framework. With extensive evaluations of our real-world implementation on Amazon EC2 and large-scale simulations, we have shown convincing evidence that max-min fairness has been achieved and the worst job completion time has been significantly improved using our new job scheduler.
引用
收藏
页码:488 / 500
页数:13
相关论文
共 50 条
  • [21] Online Scaling of NFV Service Chains Across Geo-Distributed Datacenters
    Jia, Yongzheng
    Wu, Chuan
    Li, Zongpeng
    Le, Franck
    Liu, Alex
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (02) : 699 - 710
  • [22] A packet scheduling algorithm for max-min fairness in multihop wireless LANs
    Wakuda, Kensaku
    Kasahara, Shoji
    Takahashi, Yutaka
    Kure, Yoshinobu
    Itakura, Eisaburo
    [J]. COMPUTER COMMUNICATIONS, 2009, 32 (13-14) : 1437 - 1444
  • [23] Optimizing the Cost-Performance Tradeoff for Coflows Across Geo-Distributed Datacenters
    Xu, Xinping
    Li, Wenxin
    Li, Keqiu
    Qi, Heng
    Jin, Yingwei
    [J]. IEEE ACCESS, 2018, 6 : 24488 - 24497
  • [24] Cost Optimization for Time-Bounded Request Scheduling in Geo-Distributed Datacenters
    Wei, Xiaohui
    Li, Lanxin
    Wang, Xingwang
    Liu, Yuanyuan
    [J]. CLOUD COMPUTING AND SECURITY, PT II, 2017, 10603 : 601 - 610
  • [25] Priority service and max-min fairness
    Marbach, P
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2003, 11 (05) : 733 - 746
  • [26] Priority service and max-min fairness
    Marbach, P
    [J]. IEEE INFOCOM 2002: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2002, : 266 - 275
  • [27] Max-Min Processors Scheduling
    Alquhayz, Hani
    Jemmali, Mandi
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (01): : 5 - 12
  • [28] Cost-Aware Big Data Processing Across Geo-Distributed Datacenters
    Xiao, Wenhua
    Bao, Weidong
    Zhu, Xiaomin
    Liu, Ling
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (11) : 3114 - 3127
  • [29] Distributed Resource Allocation and User Association for Max-Min Fairness in HetNets
    Kim, Yeongjun
    Jang, Jonggyu
    Yang, Hyun Jong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 2983 - 2988
  • [30] A unified framework for max-min and min-max fairness with applications
    Radunovic, Bozidar
    Le Boudec, Jean-Yves
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2007, 15 (05) : 1073 - 1083