Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

被引：0

作者：

Roy, Julien ^{[1
]}

Barde, Paul ^{[2
]}

Harvey, Felix G. ^{[1
]}

Nowrouzezahrai, Derek ^{[2
]}

Pal, Christopher ^{[1
,3
]}

机构：

[1] Polytech Montreal, Quebec AI Inst Mila, Montreal, PQ, Canada

[2] McGill Univ, Quebec AI Inst Mila, Montreal, PQ, Canada

[3] Element AI, Montreal, PQ, Canada

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide their exploration and we investigate techniques to promote such an inductive bias. We propose two policy regularization methods: TeamReg, which is based on inter-agent action predictability and CoachReg that relies on synchronized behavior selection. We evaluate each approach on four challenging continuous control tasks with sparse rewards that require varying levels of coordination as well as on the discrete action Google Research Football environment. Our experiments show improved performance across many cooperative multi-agent problems. Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.

引用

页数：12

共 50 条

[1] A multi-agent deep reinforcement learning approach for traffic signal coordination
Hu, Ta-Yin
Li, Zhuo-Yu
IET INTELLIGENT TRANSPORT SYSTEMS, 2024, 18 (08) : 1428 - 1444
[2] Distributed interference coordination based on multi-agent deep reinforcement learning
Liu T.
Luo Y.
Yang C.
Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 38 - 48
[3] Effective Multi-Agent Deep Reinforcement Learning Control With Relative Entropy Regularization
Miao, Chenyang
Cui, Yunduan
Li, Huiyun
Wu, Xinyu
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 3704 - 3718
[4] Coordination as inference in multi-agent reinforcement learning
Li, Zhiyuan
Wu, Lijun
Su, Kaile
Wu, Wei
Jing, Yulin
Wu, Tong
Duan, Weiwei
Yue, Xiaofeng
Tong, Xiyi
Han, Yizhou
NEURAL NETWORKS, 2024, 172
[5] Agent Coordination in Air Combat Simulation using Multi-Agent Deep Reinforcement Learning
Kallstrom, Johan
Heintz, Fredrik
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2157 - 2164
[6] QSOD: Hybrid Policy Gradient for Deep Multi-agent Reinforcement Learning
Rehman, Hafiz Muhammad Raza Ur
On, Byung-Won
Ningombam, Devarani Devi
Yi, Sungwon
Choi, Gyu Sang
IEEE ACCESS, 2021, 9 : 129728 - 129741
[7] Air-Ground Coordination Communication by Multi-Agent Deep Reinforcement Learning
Ding, Ruijin
Gao, Feifei
Yang, Guanghua
Shen, Xuemin Sherman
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[8] Coordination in Adversarial Multi-Agent with Deep Reinforcement Learning under Partial Observability
Diallo, Elhadji Amadou Oury
Sugawara, Toshiharu
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 198 - 205
[9] Distributed Multi-Agent Deep Reinforcement Learning for Robust Coordination against Noise
Motokawa, Yoshinari
Sugawara, Toshiharu
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[10] Multi-agent deep reinforcement learning algorithm with trend consistency regularization for portfolio management
Cong Ma
Jiangshe Zhang
Zongxin Li
Shuang Xu
Neural Computing and Applications, 2023, 35 : 6589 - 6601

← 1 2 3 4 5 →