Adaptive mean field multi-agent reinforcement learning

被引:3
|
作者
Wang, Xiaoqiang [1 ]
Ke, Liangjun [1 ]
Zhang, Gewei [1 ]
Zhu, Dapeng [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Large scale; Multi-agent reinforcement learning; Adaptive mean field approximation;
D O I
10.1016/j.ins.2024.120560
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale Multi -Agent Reinforcement Learning (MARL) is fundamentally a challenge due to the curse of dimensionality. In a homogeneous multi -agent setting, mean field theory gives an effective way of scalable MARL by abstracting other agents to a virtual mean agent, assuming that the influence between agents is equal and infinitesimal. However, in some real scenarios, only several neighboring agents, rather than all agents, affect the decision -making of an agent, and different neighboring agents may have varying degrees of influence on the agent's decisionmaking. In this paper, not restricted to a homogeneous setting, we propose adaptive mean field MARL, which is based on the attention mechanism and can be used to deal with many -agent scenarios where there may be different influence relationships among agents. Specifically, we first derive the mean field approximation with adaptive weight and give the error bound of the approximation. Then, we propose adaptive mean field Q -Learning and describe how to obtain the adaptive weight. In addition, we discuss the differences between the proposed approach and existing mean -field MARL methods. Finally, we conduct experiments on simulation platforms, and the results show that the performance of the proposed approach outperforms that of the state-of-the-art method.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Mean Field Multi-Agent Reinforcement Learning
    Yang, Yaodong
    Luo, Rui
    Li, Minne
    Zhou, Ming
    Zhang, Weinan
    Wang, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Causal Mean Field Multi-Agent Reinforcement Learning
    Ma, Hao
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Gao, Junlong
    Guo, Zhenyu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] Multi-agent reinforcement learning with adaptive mimetism
    Yamaguchi, T
    Miura, M
    Yachida, M
    ETFA '96 - 1996 IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, PROCEEDINGS, VOLS 1 AND 2, 1996, : 288 - 294
  • [4] Graphon mean-field control for cooperative multi-agent reinforcement learning
    Hu, Yuanquan
    Wei, Xiaoli
    Yan, Junji
    Zhang, Hengxi
    JOURNAL OF THE FRANKLIN INSTITUTE, 2023, 360 (18) : 14783 - 14805
  • [5] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [6] Adaptive Average Exploration in Multi-Agent Reinforcement Learning
    Hall, Garrett
    Holladay, Ken
    2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
  • [7] ADAPTIVE STATE REPRESENTATIONS FOR MULTI-AGENT REINFORCEMENT LEARNING
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 181 - 189
  • [8] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
    Lu, Yanqing
    Zhang, Meng
    Tang, Ming
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
  • [9] Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
    Mondal, Washim Uddin
    Aggarwal, Vaneet
    Ukkusuri, Satish, V
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [10] Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control
    Zhang, Zundong
    Zhang, Wei
    Liu, Yuke
    Xiong, Gang
    ELECTRONICS, 2023, 12 (22)