Multi-agent Actor-Critic Reinforcement Learning Based In-network Load Balance

被引:14
|
作者
Mai, Tianle [1 ]
Yao, Haipeng [1 ]
Xiong, Zehui [2 ]
Guo, Song [3 ]
Niyato, Dusit Tao [2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
关键词
Load balance; Multi-agent system; Actor-critic;
D O I
10.1109/GLOBECOM42002.2020.9322277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Load balancing is a difficult online decision-making problem in the current network. Recently, with the development of the programmable data-plane, it is feasible to perform flexibly load balance directly inside the network. This in-network load balance scheme can quickly adapt to the volatility of network traffic. However, previous in-network solutions are largely relying on the manual process. Inspired by recent successes in applying machine learning in online control, automating the in-network load balance process is thus appealing. But as a distributed control system, it behooves us to ask the critical question: "Can the distributed switches learn globally optimal scheduling policy and still be deployed in a distributed fashion to allow rapid reaction in real-time?" To tackle this question, we adopt a centralized learning and distributed execution framework and propose a multi-agent actor-critic reinforcement learning algorithm in this paper. The centralized "critic" is reinforced with the global network state and joint actions of all agents to ease the training process whilst distributed switches can take actions relaying on their local observations. In addition, a baseline scheme is introduced to solve the credit assignment problem in the multi-agent system. The extensive simulations are conducted to evaluate our proposed algorithm in comparison to state-of-the-art schemes.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
    Diddigi, Raghuram Bharadwaj
    Reddy, D. Sai Koti
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1931 - 1933
  • [2] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
    Prashant Trivedi
    Nandyala Hemachandra
    [J]. Dynamic Games and Applications, 2023, 13 : 25 - 55
  • [3] Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
    Christianos, Filippos
    Schafer, Lukas
    Albrecht, Stefano V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
    Heredia, Paulo C.
    Mou, Shaoshuai
    [J]. IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
  • [5] A multi-agent reinforcement learning using Actor-Critic methods
    Li, Chun-Gui
    Wang, Meng
    Yuan, Qing-Neng
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 878 - 882
  • [6] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
    Trivedi, Prashant
    Hemachandra, Nandyala
    [J]. DYNAMIC GAMES AND APPLICATIONS, 2023, 13 (01) : 25 - 55
  • [7] Actor-Critic for Multi-Agent Reinforcement Learning with Self-Attention
    Zhao, Juan
    Zhu, Tong
    Xiao, Shuo
    Gao, Zongqian
    Sun, Hao
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (09)
  • [8] Multi-agent reinforcement learning by the actor-critic model with an attention interface
    Zhang, Lixiang
    Li, Jingchen
    Zhu, Yi'an
    Shi, Haobin
    Hwang, Kao-Shing
    [J]. NEUROCOMPUTING, 2022, 471 : 275 - 284
  • [9] Structural relational inference actor-critic for multi-agent reinforcement learning
    Zhang, Xianjie
    Liu, Yu
    Xu, Xiujuan
    Huang, Qiong
    Mao, Hangyu
    Carie, Anil
    [J]. NEUROCOMPUTING, 2021, 459 : 383 - 394
  • [10] Dynamic Spectrum Sharing Based on Federated Learning and Multi-Agent Actor-Critic Reinforcement Learning
    Yang, Tongtong
    Zhang, Wensheng
    Bo, Yulian
    Sun, Jian
    Wang, Cheng-Xiang
    [J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 947 - 952