Causal Mean Field Multi-Agent Reinforcement Learning

被引:0
|
作者
Ma, Hao [2 ,3 ]
Pu, Zhiqiang [1 ,3 ]
Pan, Yi [3 ]
Liu, Boyin [1 ,3 ]
Gao, Junlong [4 ]
Guo, Zhenyu [4 ]
机构
[1] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Nanjing, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IJCNN54540.2023.10191654
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scalability remains a challenge in multi-agent reinforcement learning and is currently under active research. A framework named mean-field reinforcement learning (MFRL) could alleviate the scalability problem by employing the Mean Field Theory to turn a many-agent problem into a two-agent problem. However, this framework lacks the ability to identify essential interactions under nonstationary environments. Causality contains relatively invariant mechanisms behind interactions, though environments are nonstationary. Therefore, we propose an algorithm called causal mean-field Q-learning (CMFQ) to address the scalability problem. CMFQ is ever more robust toward the change of the number of agents though inheriting the compressed representation of MFRL's action-state space. Firstly, we model the causality behind the decision-making process of MFRL into a structural causal model (SCM). Then the essential degree of each interaction is quantified via intervening on the SCM. Furthermore, we design the causality-aware compact representation for behavioral information of agents as the weighted sum of all behavioral information according to their causal effects. We test CMFQ in a mixed cooperative-competitive game and a cooperative game. The result shows that our method has excellent scalability performance in both training in environments containing a large number of agents and testing in environments containing much more agents.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Mean Field Multi-Agent Reinforcement Learning
    Yang, Yaodong
    Luo, Rui
    Li, Minne
    Zhou, Ming
    Zhang, Weinan
    Wang, Jun
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    [J]. INFORMATION SCIENCES, 2024, 669
  • [3] Graphon mean-field control for cooperative multi-agent reinforcement learning
    Hu, Yuanquan
    Wei, Xiaoli
    Yan, Junji
    Zhang, Hengxi
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (18): : 14783 - 14805
  • [4] Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
    Li, Minne
    Qin, Zhiwei
    Jiao, Yan
    Yang, Yaodong
    Gong, Zhichen
    Wang, Jun
    Wang, Chenxi
    Wu, Guobin
    Ye, Jieping
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 983 - 994
  • [5] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
    Lu, Yanqing
    Zhang, Meng
    Tang, Ming
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
  • [6] Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control
    Zhang, Zundong
    Zhang, Wei
    Liu, Yuke
    Xiong, Gang
    [J]. ELECTRONICS, 2023, 12 (22)
  • [7] Causal inference multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    Yang, Bo
    Zeng, Zheng
    Kang, Zhongfeng
    [J]. INFORMATION FUSION, 2023, 94 : 243 - 256
  • [8] Multi-Agent Reinforcement Learning
    Stankovic, Milos
    [J]. 2016 13TH SYMPOSIUM ON NEURAL NETWORKS AND APPLICATIONS (NEUREL), 2016, : 43 - 43
  • [9] Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition
    Wu, Tingyu
    Li, Wenhao
    Jin, Bo
    Zhang, Wei
    Wang, Xiangfeng
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 301 - 316
  • [10] Mean-Field Multi-Agent Reinforcement Learning for Peer-to-Peer Multi-Energy Trading
    Qiu, Dawei
    Wang, Jianhong
    Dong, Zihang
    Wang, Yi
    Strbac, Goran
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (05) : 4853 - 4866