Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

被引:2
|
作者
Yao, Zhiyuan [1 ,2 ]
Ding, Zihan [3 ]
Clausen, Thomas [1 ]
机构
[1] Ecole Polytech, Paris, France
[2] Cisco Syst, Paris, France
[3] Princeton Univ, Princeton, NJ USA
关键词
MARL; load balancing; distributed systems;
D O I
10.1145/3511808.3557133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the network load balancing problem, a challenging real-world task for multi-agent reinforcement learning (MARL) methods. Conventional heuristic solutions like Weighted-Cost Multi-Path (WCMP) and Local Shortest Queue (LSQ) are less flexible to the changing workload distributions and arrival rates, with a poor balance among multiple load balancers. The cooperative network load balancing task is formulated as a Dec-POMDP problem, which naturally induces the MARL methods. To bridge the reality gap for applying learning-based methods, all models are directly trained and evaluated on a real-world system from moderateto large-scale setups. Experimental evaluations show that the independent and "selfish" load balancing strategies are not necessarily the globally optimal ones, while the proposed MARL solution has a superior performance over different realistic settings. Additionally, the potential difficulties of the application and deployment of MARL methods for network load balancing are analysed, which helps draw the attention of the learning and network communities to such challenges.
引用
收藏
页码:3594 / 3603
页数:10
相关论文
共 50 条
  • [1] Multi-Agent Graph Convolutional Reinforcement Learning for Intelligent Load Balancing
    Houidi, Omar
    Bakri, Sihem
    Zeghlache, Djamal
    PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [2] Adaptive Load Balancing: A Study in Multi-Agent Learning
    Schaerf, Andrea
    Shoham, Yoav
    Tennenholtz, Moshe
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 : 475 - 500
  • [3] Multi-agent Reinforcement Learning in Network Management
    Bagnasco, Ricardo
    Serrat, Joan
    SCALABILITY OF NETWORKS AND SERVICES, PROCEEDINGS, 2009, 5637 : 199 - 202
  • [4] SCM network with multi-agent reinforcement learning
    Zhao, Gang
    Sun, Ruoying
    FIFTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS 1-3, 2006, : 1294 - 1300
  • [5] A Cooperative Multi-Agent Reinforcement Learning Framework for Resource Balancing in Complex Logistics Network
    Li, Xihan
    Zhang, Jia
    Bian, Jiang
    Tong, Yunhai
    Liu, Tie-Yan
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 980 - 988
  • [6] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [7] Reinforcement Learning based Load Balancing for Data Center Networks
    Lim, Jiyoon
    Yoo, Jae-Hyoung
    Hong, James Won-Ki
    PROCEEDINGS OF THE 2021 IEEE 7TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2021): ACCELERATING NETWORK SOFTWARIZATION IN THE COGNITIVE AGE, 2021, : 151 - 155
  • [8] MADELYN: Multi-Domain Multi-Agent Reinforcement Learning for Data-center Networks
    Kattepur, Ajay
    David, Sushanth
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 624 - 629
  • [9] MARVEL: Enabling controller load balancing in software-defined networks with multi-agent reinforcement learning
    Sun, Penghao
    Guo, Zehua
    Wang, Gang
    Lan, Julong
    Hu, Yuxiang
    COMPUTER NETWORKS, 2020, 177 (177)
  • [10] Multi-Agent Deep Reinforcement Learning for Distributed Load Restoration
    Linh Vu
    Tuyen Vu
    Thanh Long Vu
    Srivastava, Anurag
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1749 - 1760