Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

被引:0
|
作者
Roy, Julien [1 ]
Barde, Paul [2 ]
Harvey, Felix G. [1 ]
Nowrouzezahrai, Derek [2 ]
Pal, Christopher [1 ,3 ]
机构
[1] Polytech Montreal, Quebec AI Inst Mila, Montreal, PQ, Canada
[2] McGill Univ, Quebec AI Inst Mila, Montreal, PQ, Canada
[3] Element AI, Montreal, PQ, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide their exploration and we investigate techniques to promote such an inductive bias. We propose two policy regularization methods: TeamReg, which is based on inter-agent action predictability and CoachReg that relies on synchronized behavior selection. We evaluate each approach on four challenging continuous control tasks with sparse rewards that require varying levels of coordination as well as on the discrete action Google Research Football environment. Our experiments show improved performance across many cooperative multi-agent problems. Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A multi-agent deep reinforcement learning approach for traffic signal coordination
    Hu, Ta-Yin
    Li, Zhuo-Yu
    IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
  • [2] Distributed interference coordination based on multi-agent deep reinforcement learning
    Liu T.
    Luo Y.
    Yang C.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 38 - 48
  • [3] Effective Multi-Agent Deep Reinforcement Learning Control With Relative Entropy Regularization
    Miao, Chenyang
    Cui, Yunduan
    Li, Huiyun
    Wu, Xinyu
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 3704 - 3718
  • [4] Coordination as inference in multi-agent reinforcement learning
    Li, Zhiyuan
    Wu, Lijun
    Su, Kaile
    Wu, Wei
    Jing, Yulin
    Wu, Tong
    Duan, Weiwei
    Yue, Xiaofeng
    Tong, Xiyi
    Han, Yizhou
    NEURAL NETWORKS, 2024, 172
  • [5] Agent Coordination in Air Combat Simulation using Multi-Agent Deep Reinforcement Learning
    Kallstrom, Johan
    Heintz, Fredrik
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2157 - 2164
  • [6] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
    Rehman, Hafiz Muhammad Raza Ur
    On, Byung-Won
    Ningombam, Devarani Devi
    Yi, Sungwon
    Choi, Gyu Sang
    IEEE ACCESS, 2021, 9 : 129728 - 129741
  • [7] Air-Ground Coordination Communication by Multi-Agent Deep Reinforcement Learning
    Ding, Ruijin
    Gao, Feifei
    Yang, Guanghua
    Shen, Xuemin Sherman
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [8] Coordination in Adversarial Multi-Agent with Deep Reinforcement Learning under Partial Observability
    Diallo, Elhadji Amadou Oury
    Sugawara, Toshiharu
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 198 - 205
  • [9] Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
    Motokawa, Yoshinari
    Sugawara, Toshiharu
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Multi-agent deep reinforcement learning algorithm with trend consistency regularization for portfolio management
    Cong Ma
    Jiangshe Zhang
    Zongxin Li
    Shuang Xu
    Neural Computing and Applications, 2023, 35 : 6589 - 6601