Graphon mean-field control for cooperative multi-agent reinforcement learning

被引:2
|
作者
Hu, Yuanquan [1 ]
Wei, Xiaoli [1 ]
Yan, Junji [1 ]
Zhang, Hengxi [1 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst China, Shenzhen, Peoples R China
关键词
CONVERGENCE; MODEL; MARL;
D O I
10.1016/j.jfranklin.2023.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The marriage between mean-field theory and reinforcement learning has shown a great capacity to solve large-scale control problems with ho-mogeneous agents. To break the homogeneity restriction of mean-field theory, a recent interest is to introduce graphon theory to the mean-field paradigm. In this paper, we propose a graphon mean-field control (GMFC) framework to approximate cooperative heterogeneous multi-agent reinforcement learning (MARL) with nonuniform interactions and heterogeneous reward functions and state transition functions among agents )and show that the approximate order is of O(1 root, with N the number of agents. By discretizing the graphon index of GMFC, we further introduce a N smaller class of GMFC called block GMFC, which is shown to well approximate cooperative MARL in terms of the value function and the policy. Finally, we design a Proximal Policy Optimization based algorithm for block GMFC that converges to the optimal policy of cooperative MARL. Our empirical studies on several examples demonstrate that our GMFC approach is comparable with the state-of-art MARL algorithms while enjoying better scalability.
引用
收藏
页码:14783 / 14805
页数:23
相关论文
共 50 条
  • [1] Mean Field Multi-Agent Reinforcement Learning
    Yang, Yaodong
    Luo, Rui
    Li, Minne
    Zhou, Ming
    Zhang, Weinan
    Wang, Jun
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Autonomous Swarm Robot Coordination via Mean-Field Control Embedding Multi-Agent Reinforcement Learning
    Tang, Huaze
    Zhang, Hengxi
    Shi, Zhenpeng
    Chen, Xinlei
    Ding, Wenbo
    Zhang, Xiao-Ping
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 8820 - 8826
  • [3] On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
    Mondal, Washim Uddin
    Aggarwal, Vaneet
    Ukkusuri, Satish, V
    Agarwal, Mridul
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [4] On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
    Mondal, Washim Uddin
    Agarwal, Mridul
    Aggarwal, Vaneet
    Ukkusuri, Satish V.
    [J]. Journal of Machine Learning Research, 2022, 23
  • [5] Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition
    Wu, Tingyu
    Li, Wenhao
    Jin, Bo
    Zhang, Wei
    Wang, Xiangfeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 301 - 316
  • [6] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    [J]. INFORMATION SCIENCES, 2024, 669
  • [7] Causal Mean Field Multi-Agent Reinforcement Learning
    Ma, Hao
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Gao, Junlong
    Guo, Zhenyu
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Mean-Field Multi-Agent Reinforcement Learning for Peer-to-Peer Multi-Energy Trading
    Qiu, Dawei
    Wang, Jianhong
    Dong, Zihang
    Wang, Yi
    Strbac, Goran
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (05) : 4853 - 4866
  • [9] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [10] Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control
    Peake, Ashley
    McCalmon, Joe
    Raiford, Benjamin
    Liu, Tongtong
    Alqahtani, Sarra
    [J]. 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 15 - 22