Multi-Agent Incentive Communication via Decentralized Teammate Modeling

被引：0

作者：

Yuan, Lei ^{[1
,3
]}

Wang, Jianhao ^{[2
]}

Zhang, Fuxiang ^{[1
]}

Wang, Chenghe ^{[1
]}

Zhang, Zongzhang ^{[1
]}

Yu, Yang ^{[1
,3
,4
]}

Zhang, Chongjie ^{[2
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China

[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China

[3] Polixir Technol, Nanjing 210000, Peoples R China

[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective communication can improve coordination in cooperative multi-agent reinforcement learning (MARL). One popular communication scheme is exchanging agents' local observations or latent embeddings and using them to augment individual local policy input. Such a communication paradigm can reduce uncertainty for local decision-making and induce implicit coordination. However, it enlarges agents' local policy spaces and increases learning complexity, leading to poor coordination in complex settings. To handle this limitation, this paper proposes a novel framework named Multi-Agent Incentive Communication (MAIC) that allows each agent to learn to generate incentive messages and bias other agents' value functions directly, resulting in effective explicit coordination. Our method firstly learns targeted teammate models, with which each agent can anticipate the teammate's action selection and generate tailored messages to specific agents. We further introduce a novel regularization to leverage interaction sparsity and improve communication efficiency. MAIC is agnostic to specific MARL algorithms and can be flexibly integrated with different value function factorization methods. Empirical results demonstrate that our method significantly outperforms baselines and achieves excellent performance on multiple cooperative MARL tasks.

引用

页码：9466 / 9474

页数：9

共 50 条

[1] Multi-agent cooperative strategy with explicit teammate modeling and targeted informative communication
Jiang, Rui
Zhang, Xuetao
Liu, Yisha
Xu, Yi
Zhang, Xuebo
Zhuang, Yan
NEUROCOMPUTING, 2024, 586
[2] Decentralized multi-agent cooperation via adaptive partner modeling
Xu, Chenhang
Wang, Jia
Zhu, Xiaohui
Yue, Yong
Zhou, Weifeng
Liang, Zhixuan
Wojtczak, Dominik
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4989 - 5004
[3] Decentralized communication strategies for coordinated multi-agent policies
Roth, M
Simmons, R
Veloso, M
Multi-Robot Systems - From Swarms to Intelligent Automata Vol III, 2005, : 93 - 105
[4] Periodic communication logics for the decentralized control of multi-agent systems
Sun, YS
Lemmon, MD
2005 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA), VOLS 1AND 2, 2005, : 1431 - 1434
[5] Decentralized Localization for Multi-agent Systems Based on Asynchronous Communication
Lu, Lei
Hu, Jinwen
Pan, Quan
Zhao, Chunhui
Xu, Zhao
Jia, Caijuan
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5489 - 5494
[6] Decentralized Multi-agent Coordination under MITL Specifications and Communication Constraints
Wang, Wei
Schuppe, Georg Friedrich
Tumova, Jana
2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 842 - 849
[7] Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
Lidard, Justin
Madhushani, Udari
Leonard, Naomi Ehrich
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3311 - 3316
[8] Decentralized Communication Range Adjustment Issues in Multi-Agent Mobile Networks
Stergiopoulos, John
Tzes, Anthony
2010 AMERICAN CONTROL CONFERENCE, 2010, : 1629 - 1634
[9] Decentralized Multi-agent Coordination under MITL Tasks and Communication Constraints
Wang, Wei
Schuppe, Georg Friedrich
Tumova, Jana
2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, : 320 - 321
[10] Neighborhood-Oriented Decentralized Learning Communication in Multi-Agent System
Dai, Hao
Wu, Jiashu
Brinkmann, Andre
Wang, Yang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 490 - 502

← 1 2 3 4 5 →