Mean Field Multi-Agent Reinforcement Learning

被引:0
|
作者
Yang, Yaodong [1 ]
Luo, Rui [1 ]
Li, Minne [1 ]
Zhou, Ming [2 ]
Zhang, Weinan [2 ]
Wang, Jun [1 ]
机构
[1] UCL, London, England
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
GAMES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of agent interactions. In this paper, we present Mean Field Reinforcement Learning where the interactions within the population of agents are approximated by those between a single agent and the average effect from the overall population or neighboring agents; the interplay between the two entities is mutually reinforced: the learning of the individual agent's optimal policy depends on the dynamics of the population, while the dynamics of the population change according to the collective patterns of the individual policies. We develop practical mean field Q-learning and mean field Actor-Critic algorithms and analyze the convergence of the solution to Nash equilibrium. Experiments on Gaussian squeeze, Ising model, and battle games justify the learning effectiveness of our mean field approaches. In addition, we report the first result to solve the Ising model via model-free reinforcement learning methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    [J]. INFORMATION SCIENCES, 2024, 669
  • [2] Causal Mean Field Multi-Agent Reinforcement Learning
    Ma, Hao
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Gao, Junlong
    Guo, Zhenyu
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] Graphon mean-field control for cooperative multi-agent reinforcement learning
    Hu, Yuanquan
    Wei, Xiaoli
    Yan, Junji
    Zhang, Hengxi
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (18): : 14783 - 14805
  • [4] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [5] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
    Lu, Yanqing
    Zhang, Meng
    Tang, Ming
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
  • [6] Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control
    Zhang, Zundong
    Zhang, Wei
    Liu, Yuke
    Xiong, Gang
    [J]. ELECTRONICS, 2023, 12 (22)
  • [7] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [8] Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition
    Wu, Tingyu
    Li, Wenhao
    Jin, Bo
    Zhang, Wei
    Wang, Xiangfeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 301 - 316
  • [9] Mean-Field Multi-Agent Reinforcement Learning for Peer-to-Peer Multi-Energy Trading
    Qiu, Dawei
    Wang, Jianhong
    Dong, Zihang
    Wang, Yi
    Strbac, Goran
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (05) : 4853 - 4866
  • [10] On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
    Mondal, Washim Uddin
    Aggarwal, Vaneet
    Ukkusuri, Satish, V
    Agarwal, Mridul
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23