Multi-Agent Incentive Communication via Decentralized Teammate Modeling

被引：0

作者：

Yuan, Lei ^{[1
,3
]}

Wang, Jianhao ^{[2
]}

Zhang, Fuxiang ^{[1
]}

Wang, Chenghe ^{[1
]}

Zhang, Zongzhang ^{[1
]}

Yu, Yang ^{[1
,3
,4
]}

Zhang, Chongjie ^{[2
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China

[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China

[3] Polixir Technol, Nanjing 210000, Peoples R China

[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Effective communication can improve coordination in cooperative multi-agent reinforcement learning (MARL). One popular communication scheme is exchanging agents' local observations or latent embeddings and using them to augment individual local policy input. Such a communication paradigm can reduce uncertainty for local decision-making and induce implicit coordination. However, it enlarges agents' local policy spaces and increases learning complexity, leading to poor coordination in complex settings. To handle this limitation, this paper proposes a novel framework named Multi-Agent Incentive Communication (MAIC) that allows each agent to learn to generate incentive messages and bias other agents' value functions directly, resulting in effective explicit coordination. Our method firstly learns targeted teammate models, with which each agent can anticipate the teammate's action selection and generate tailored messages to specific agents. We further introduce a novel regularization to leverage interaction sparsity and improve communication efficiency. MAIC is agnostic to specific MARL algorithms and can be flexibly integrated with different value function factorization methods. Empirical results demonstrate that our method significantly outperforms baselines and achieves excellent performance on multiple cooperative MARL tasks.

引用

页码：9466 / 9474

页数：9

共 50 条

[41] Decentralized control of connectivity for multi-agent systems
De Gennaro, Maria Carmela
Jadbabaie, Ali
PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 3631 - +
[42] Centralized vs Decentralized Multi-Agent Guesswork
Salamatian, Salman
Beirami, Ahmad
Cohen, Asaf
Medard, Muriel
2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 2258 - 2262
[43] Multi-Agent Coordination by Decentralized Estimation and Control
Yang, Peng
Freeman, Randy A.
Lynch, Kevin M.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (11) : 2480 - 2496
[44] A Decentralized Multi-Agent Path Planning Approach Based on Imitation Learning and Selective Communication
Feng, Bohan
Bi, Youyi
Li, Mian
Lin, Liyong
JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (08)
[45] Efficient agent communication in multi-agent systems
Jang, MW
Ahmed, A
Agha, G
SOFTWARE ENGINEERING FOR MULTI-AGENT SYSTEMS III: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2004, 3390 : 236 - 253
[46] Urban Traffic Light Control via Active Multi-Agent Communication and Supply-Demand Modeling
Guo, Xin
Yu, Zhengxu
Wang, Pengfei
Jin, Zhongming
Huang, Jianqiang
Cai, Deng
He, Xiaofei
Hua, Xian-Sheng
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4346 - 4356
[47] Multi-agent policy transfer via task relationship modeling
Qin, Rongjun
Chen, Feng
Wang, Tonghan
Yuan, Lei
Wu, Xiaoran
Kang, Yipeng
Zhang, Zongzhang
Zhang, Chongjie
Yu, Yang
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (08)
[48] Multi-agent policy transfer via task relationship modeling
Rongjun QIN
Feng CHEN
Tonghan WANG
Lei YUAN
Xiaoran WU
Yipeng KANG
Zongzhang ZHANG
Chongjie ZHANG
Yang YU
Science China(Information Sciences), 2024, 67 (08) : 102 - 114
[49] Decentralized MCTS via Learned Teammate Models
Czechowski, Aleksander
Oliehoek, Frans A.
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 81 - 88
[50] Behavior modeling based on multi-agent and multi-agent simulation environment
Yin, QJ
Du, XY
Huang, K
SYSTEM SIMULATION AND SCIENTIFIC COMPUTING, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1531 - 1536

← 1 2 3 4 5 →