Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

被引：0

作者：

Roy, Julien ^{[1
]}

Barde, Paul ^{[2
]}

Harvey, Felix G. ^{[1
]}

Nowrouzezahrai, Derek ^{[2
]}

Pal, Christopher ^{[1
,3
]}

机构：

[1] Polytech Montreal, Quebec AI Inst Mila, Montreal, PQ, Canada

[2] McGill Univ, Quebec AI Inst Mila, Montreal, PQ, Canada

[3] Element AI, Montreal, PQ, Canada

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-agent reinforcement learning, discovering successful collective behaviors is challenging as it requires exploring a joint action space that grows exponentially with the number of agents. While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide their exploration and we investigate techniques to promote such an inductive bias. We propose two policy regularization methods: TeamReg, which is based on inter-agent action predictability and CoachReg that relies on synchronized behavior selection. We evaluate each approach on four challenging continuous control tasks with sparse rewards that require varying levels of coordination as well as on the discrete action Google Research Football environment. Our experiments show improved performance across many cooperative multi-agent problems. Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.

引用

页数：12

共 50 条

[31] Toward Policy Explanations for Multi-Agent Reinforcement Learning
Boggess, Kayla
Kraus, Sarit
Feng, Lu
PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 109 - 115
[32] Uncertainty modified policy for multi-agent reinforcement learning
Zhao, Xinyu
Liu, Jianxiang
Wu, Faguo
Zhang, Xiao
Wang, Guojian
APPLIED INTELLIGENCE, 2024, 54 (22) : 12020 - 12034
[33] Federated Dynamic Spectrum Access through Multi-Agent Deep Reinforcement Learning
Song, Yifei
Chang, Hao-Hsuan
Liu, Lingjia
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3466 - 3471
[34] Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning
Jing, Yao
Guo, Bin
Li, Nuo
Ding, Yasan
Liu, Yan
Yu, Zhiwen
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[35] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
Yi Liu
Xiang Wu
Yuming Bo
Jiacun Wang
Lifeng Ma
IEEE/CAA Journal of Automatica Sinica, 2024, 11 (11) : 2346 - 2348
[36] A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning
Liu, Yi
Wu, Xiang
Bo, Yuming
Wang, Jiacun
Ma, Lifeng
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (11) : 2346 - 2348
[37] A review of cooperative multi-agent deep reinforcement learning
Afshin Oroojlooy
Davood Hajinezhad
Applied Intelligence, 2023, 53 : 13677 - 13722
[38] Multi-Agent Deep Reinforcement Learning with Emergent Communication
Simoes, David
Lau, Nuno
Reis, Luis Paulo
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[39] Experience Selection in Multi-Agent Deep Reinforcement Learning
Wang, Yishen
Zhang, Zongzhang
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 864 - 870
[40] Sparse communication in multi-agent deep reinforcement learning
Han, Shuai
Dastani, Mehdi
Wang, Shihan
NEUROCOMPUTING, 2025, 625

← 1 2 3 4 5 →