Adaptive mean field multi-agent reinforcement learning

被引:1
|
作者
Wang, Xiaoqiang [1 ]
Ke, Liangjun [1 ]
Zhang, Gewei [1 ]
Zhu, Dapeng [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Large scale; Multi-agent reinforcement learning; Adaptive mean field approximation;
D O I
10.1016/j.ins.2024.120560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale Multi -Agent Reinforcement Learning (MARL) is fundamentally a challenge due to the curse of dimensionality. In a homogeneous multi -agent setting, mean field theory gives an effective way of scalable MARL by abstracting other agents to a virtual mean agent, assuming that the influence between agents is equal and infinitesimal. However, in some real scenarios, only several neighboring agents, rather than all agents, affect the decision -making of an agent, and different neighboring agents may have varying degrees of influence on the agent's decisionmaking. In this paper, not restricted to a homogeneous setting, we propose adaptive mean field MARL, which is based on the attention mechanism and can be used to deal with many -agent scenarios where there may be different influence relationships among agents. Specifically, we first derive the mean field approximation with adaptive weight and give the error bound of the approximation. Then, we propose adaptive mean field Q -Learning and describe how to obtain the adaptive weight. In addition, we discuss the differences between the proposed approach and existing mean -field MARL methods. Finally, we conduct experiments on simulation platforms, and the results show that the performance of the proposed approach outperforms that of the state-of-the-art method.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Partitioning in multi-agent reinforcement learning
    Sun, R
    Peterson, T
    [J]. FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
  • [32] Multi-Agent Reinforcement Learning for Microgrids
    Dimeas, A. L.
    Hatziargyriou, N. D.
    [J]. IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [33] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [34] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    [J]. 2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [35] The Dynamics of Multi-Agent Reinforcement Learning
    Dickens, Luke
    Broda, Krysia
    Russo, Alessandra
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
  • [36] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    [J]. 2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [37] Heterogeneous Mean-Field Multi-Agent Reinforcement Learning for Communication Routing Selection in SAGI-Net
    Zhang, Hengxi
    Tang, Huaze
    Hu, Yuanquan
    Wei, Xiaoli
    Wu, Chenye
    Ding, Wenbo
    Zhang, Xiao-Ping
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [38] Autonomous Swarm Robot Coordination via Mean-Field Control Embedding Multi-Agent Reinforcement Learning
    Tang, Huaze
    Zhang, Hengxi
    Shi, Zhenpeng
    Chen, Xinlei
    Ding, Wenbo
    Zhang, Xiao-Ping
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 8820 - 8826
  • [39] GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning
    Hao, Qianyue
    Huang, Wenzhen
    Feng, Tao
    Yuan, Jian
    Li, Yong
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 685 - 697
  • [40] Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph Attention Network for UAV Swarms
    Yang, Min
    Liu, Guanjun
    Zhou, Ziyuan
    Wang, Jiacun
    [J]. DRONES, 2023, 7 (07)