Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

被引:0
|
作者
Roy, Julien [1 ]
Barde, Paul [2 ]
Harvey, Felix G. [1 ]
Nowrouzezahrai, Derek [2 ]
Pal, Christopher [1 ,3 ]
机构
[1] Polytech Montreal, Quebec AI Inst Mila, Montreal, PQ, Canada
[2] McGill Univ, Quebec AI Inst Mila, Montreal, PQ, Canada
[3] Element AI, Montreal, PQ, Canada
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide their exploration and we investigate techniques to promote such an inductive bias. We propose two policy regularization methods: TeamReg, which is based on inter-agent action predictability and CoachReg that relies on synchronized behavior selection. We evaluate each approach on four challenging continuous control tasks with sparse rewards that require varying levels of coordination as well as on the discrete action Google Research Football environment. Our experiments show improved performance across many cooperative multi-agent problems. Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Toward Policy Explanations for Multi-Agent Reinforcement Learning
    Boggess, Kayla
    Kraus, Sarit
    Feng, Lu
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 109 - 115
  • [32] Uncertainty modified policy for multi-agent reinforcement learning
    Zhao, Xinyu
    Liu, Jianxiang
    Wu, Faguo
    Zhang, Xiao
    Wang, Guojian
    APPLIED INTELLIGENCE, 2024, 54 (22) : 12020 - 12034
  • [33] Federated Dynamic Spectrum Access through Multi-Agent Deep Reinforcement Learning
    Song, Yifei
    Chang, Hao-Hsuan
    Liu, Lingjia
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3466 - 3471
  • [34] Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning
    Jing, Yao
    Guo, Bin
    Li, Nuo
    Ding, Yasan
    Liu, Yan
    Yu, Zhiwen
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [35] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
    Yi Liu
    Xiang Wu
    Yuming Bo
    Jiacun Wang
    Lifeng Ma
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (11) : 2346 - 2348
  • [36] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
    Liu, Yi
    Wu, Xiang
    Bo, Yuming
    Wang, Jiacun
    Ma, Lifeng
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (11) : 2346 - 2348
  • [37] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    Applied Intelligence, 2023, 53 : 13677 - 13722
  • [38] Multi-Agent Deep Reinforcement Learning with Emergent Communication
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [39] Experience Selection in Multi-Agent Deep Reinforcement Learning
    Wang, Yishen
    Zhang, Zongzhang
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 864 - 870
  • [40] Sparse communication in multi-agent deep reinforcement learning
    Han, Shuai
    Dastani, Mehdi
    Wang, Shihan
    NEUROCOMPUTING, 2025, 625