Learning Individually Inferred Communication for Multi-Agent Cooperation

被引：0

作者：

Ding, Ziluo ^{[1
]}

Huang, Tiejun ^{[1
]}

Lu, Zongqing ^{[1
]}

机构：

[1] Peking Univ, Beijing, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

LEVEL;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Communication lays the foundation for human cooperation. It is also crucial for multi-agent cooperation. However, existing work focuses on broadcast communication, which is not only impractical but also leads to information redundancy that could even impair the learning process. To tackle these difficulties, we propose Individually Inferred Communication (I2C), a simple yet effective model to enable agents to learn a prior for agent-agent communication. The prior knowledge is learned via causal inference and realized by a feed-forward neural network that maps the agent's local observation to a belief about who to communicate with. The influence of one agent on another is inferred via the joint action-value function in multi-agent reinforcement learning and quantified to label the necessity of agent-agent communication. Furthermore, the agent policy is regularized to better exploit communicated messages. Empirically, we show that I2C can not only reduce communication overhead but also improve the performance in a variety of multi-agent cooperative scenarios, comparing to existing methods.

引用

页数：11

共 50 条

[1] Learning Attentional Communication for Multi-Agent Cooperation
Jiang, Jiechuan
Lu, Zongqing
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[2] Learning multi-agent cooperation
Rivera, Corban
Staley, Edward
Llorens, Ashley
FRONTIERS IN NEUROROBOTICS, 2022, 16
[3] Simultaneous Policy and Discrete Communication Learning for Multi-Agent Cooperation
Freed, Benjamin
Sartoretti, Guillaume
Choset, Howie
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 2498 - 2505
[4] Sparse Discrete Communication Learning for Multi-Agent Cooperation Through Backpropagation
Freed, Benjamin
James, Rohan
Sartoretti, Guillaume
Choset, Howie
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7993 - 7998
[5] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
Wang, Huimu
Qiu, Tenghai
Liu, Zhen
Pu, Zhiqiang
Yi, Jianqiang
Yuan, Wanmai
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[6] Knowledge-guided communication preference learning model for multi-agent cooperation
Zhang, Han
Yu, Hang
Wang, Xiaoming
Wang, Mengke
Zhang, Zhenyu
Li, Yang
Xie, Shaorong
Luo, Xiangfeng
INFORMATION SCIENCES, 2024, 667
[7] Analysis about efficiency of indirect media communication on multi-agent cooperation learning
Zhao, Gang
Sun, Ruoying
2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4180 - +
[8] Multi-agent communication cooperation based on deep reinforcement learning and information theory
Gao, Bing
Zhang, Zhejie
Zou, Qijie
Liu, Zhiguo
Zhao, Xiling
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):
[9] ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model
Han, Xue
Yan, Hongping
Zhang, Junge
Wang, Lingfeng
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 219 - 229
[10] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920

← 1 2 3 4 5 →