Learning controlled and targeted communication with the centralized critic for the multi-agent system

被引：0

作者：

Qingshuang Sun

Yuan Yao

Peng Yi

YuJiao Hu

Zhao Yang

Gang Yang

Xingshe Zhou

机构：

[1] Northwestern Polytechnical University,School of Computer Science

[2] Purple Mountain Laboratories,Future network center

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Reinforcement learning; Centralized critic; Communication; Cooperation; Multi-agent system;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Multi-agent deep reinforcement learning (MDRL) has attracted attention for solving complex tasks. Two main challenges of MDRL are non-stationarity and partial observability from the perspective of agents, impacting the performance of agents’ learning cooperative policies. In this study, Controlled and Targeted Communication with the Centralized Critic (COTAC) is proposed, thereby constructing the paradigm of centralized learning and decentralized execution with partial communication. It is capable of decoupling how the MAS obtains environmental information during training and execution. Specifically, COTAC can make the environment faced by agents to be stationarity in the training phase and learn partial communication to overcome the limitation of partial observability in the execution phase. Based on this, decentralized actors learn controlled and targeted communication and policies optimized by centralized critics during training. As a result, agents comprehensively learn when to communicate during the sending and how to target information aggregation during the receiving. Apart from that, COTAC is evaluated on two multi-agent scenarios with continuous space. Experimental results demonstrated that partial agents with important information choose to send messages and targeted aggregate received information by identifying the relevant important information, which can still have better cooperation performance while reducing the communication traffic of the system.

引用

页码：14819 / 14837

页数：18

共 50 条

[1] Learning controlled and targeted communication with the centralized critic for the multi-agent system
Sun, Qingshuang
Yao, Yuan
Yi, Peng
Hu, YuJiao
Yang, Zhao
Yang, Gang
Zhou, Xingshe
[J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14819 - 14837
[2] Multi-agent actor centralized-critic with communication
Simoes, David
Lau, Nuno
Reis, Luis Paulo
[J]. NEUROCOMPUTING, 2020, 390 : 40 - 56
[3] Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System
Sun, Qingshuang
Yao, Yuan
Yi, Peng
Zhou, Xingshe
Yang, Gang
[J]. COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2021, PT II, 2022, 1492 : 134 - 146
[4] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
[J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
[5] Targeted Multi-Agent Communication with Deep Metric Learning
Miao, Hua
Yu, Nanxiang
[J]. ENGINEERING LETTERS, 2023, 31 (02) : 712 - 723
[6] Exploring communication protocols and centralized critics in multi-agent deep learning
Simoes, David
Lau, Nuno
Reis, Luis Paulo
[J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2020, 27 (04) : 333 - 351
[7] A Study for Comparative Analysis of Dueling DQN and Centralized Critic Approaches in Multi-Agent Reinforcement Learning
Sugimoto, Masashi
Hasegawa, Kaito
Ishida, Yuuki
Ohnishi, Rikuto
Nakagami, Kouki
Tsuzuki, Shinji
Urushihara, Shiro
Sori, Hitoshi
[J]. JOURNAL OF ROBOTICS AND MECHATRONICS, 2024, 36 (03) : 589 - 602
[8] TarMAC: Targeted Multi-Agent Communication
Das, Abhishek
Gervet, Theophile
Romoff, Joshua
Batra, Dhruv
Parikh, Devi
Rabbat, Michael
Pineau, Joelle
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[9] Centralized Critic per Knowledge for Cooperative Multi-Agent Game Environments
Ferreira, Thais
Clua, Esteban
Kohwalter, Troy Costa
[J]. 2021 20TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2021), 2021, : 39 - 48
[10] On Centralized Critics in Multi-Agent Reinforcement Learning
Lyu, Xueguang
Baisero, Andrea
Xiao, Yuchen
Daley, Brett
Amato, Christopher
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 295 - 354

← 1 2 3 4 5 →