On Datacenter-Network-Aware Load Balancing in MapReduce

被引:2
|
作者
Le, Yanfang [1 ]
Wang, Feng [2 ]
Liu, Jiangchuan [1 ]
Ergun, Funda [1 ,3 ]
机构
[1] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
[2] Univ Mississippi, University, MS 38677 USA
[3] Indiana Univ Bloomington, Bloomington, IN USA
关键词
D O I
10.1109/CLOUD.2015.71
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce has emerged as a powerful tool for distributed and scalable processing of voluminous data. For skewed data input, load balancing is necessary among the MapReduce worker nodes to minimize the overall finishing time, which however can incur massive data movement in a datacenter network. In this paper, we for the first time examine this problem of datacenter-network-aware load balancing in the shuffle subphase in MapReduce. Different from earlier studies that generally assume the network inside a datacenter has negligible delay and infinite capacity, we consider the traffic and bottlenecks in real datacenter networks by introducing the constraints on available network bandwidth, and demonstrate that the corresponding problem can be decomposed into two subproblems for network flow and load balancing, respectively. We show effective solutions to both of them, which together yield a complete solution towards near optimal datacenter-network-aware load balancing. A much simpler yet performance-wise comparable greedy algorithm is also developed for fast implementation in practice. The effectiveness of our solution has been demonstrated on synthetic and real public datasets.
引用
收藏
页码:485 / 492
页数:8
相关论文
共 50 条
  • [41] OmniFlow: Coupling Load Balancing with Flow Control in Datacenter Networks
    Wen, Kaiyuan
    Qian, Zhuzhong
    Zhang, Sheng
    Lu, Sanglu
    [J]. PROCEEDINGS 2016 IEEE 36TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2016, 2016, : 725 - 726
  • [42] Enabling Traffic-Differentiated Load Balancing for Datacenter Networks
    Hu, Jinbin
    Liu, Ying
    Rao, Shuying
    Wang, Jing
    Zhang, Dengyong
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT III, 2024, 14489 : 250 - 269
  • [43] Research on the Type Aware Load Balancing
    Jing Li
    Ning Huangfu
    Dong Li
    [J]. 2014 INTERNATIONAL CONFERENCE ON ECONOMICS AND MANAGEMENT, 2014, : 272 - 276
  • [44] Congestion-Aware Handover in LTE Systems for Load Balancing in Transport Network
    Marwat, Safdar Nawaz Khan
    Meyer, Sven
    Weerawardane, Thushara
    Goerg, Carmelita
    [J]. ETRI JOURNAL, 2014, 36 (05) : 761 - 771
  • [45] End-node-based congestion-aware network load balancing
    Teruhi, S
    Uematsu, Y
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-7, 2004, : 2174 - 2178
  • [46] Network Aware VM Load Balancing in Cloud Data Centers Using SDN
    Tsygankov, Mykola
    Chen, Chien
    [J]. 2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON LOCAL AND METROPOLITAN AREA NETWORKS (LANMAN), 2017,
  • [47] Accelerator-Aware In-Network Load Balancing for Improved Application Performance
    Tajbakhsh, Hesam
    Parizotto, Ricardo
    Neves, Miguel
    Schaeffer-Filho, Alberto
    Haque, Israat
    [J]. 2022 IFIP NETWORKING CONFERENCE (IFIP NETWORKING), 2022,
  • [48] HLB: Toward Load-Aware Load Balancing
    Yao, Zhiyuan
    Desmouceaux, Yoann
    Cordero-Fuertes, Juan-Antonio
    Townsley, Mark
    Clausen, Thomas
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2022, 30 (06) : 2658 - 2673
  • [49] RILNET: A Reinforcement Learning Based Load Balancing Approach for Datacenter Networks
    Lin, Qinliang
    Gong, Zhibo
    Wang, Qiaoling
    Li, Jinlong
    [J]. MACHINE LEARNING FOR NETWORKING, 2019, 11407 : 44 - 55
  • [50] RLB: Reordering-Robust Load Balancing in Lossless Datacenter Networks
    Hu, Jinbin
    He, Yi
    Wang, Jin
    Luo, Wangqing
    Huang, Jiawei
    [J]. PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 576 - 584