Learning controlled and targeted communication with the centralized critic for the multi-agent system

被引:0
|
作者
Qingshuang Sun
Yuan Yao
Peng Yi
YuJiao Hu
Zhao Yang
Gang Yang
Xingshe Zhou
机构
[1] Northwestern Polytechnical University,School of Computer Science
[2] Purple Mountain Laboratories,Future network center
来源
Applied Intelligence | 2023年 / 53卷
关键词
Reinforcement learning; Centralized critic; Communication; Cooperation; Multi-agent system;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-agent deep reinforcement learning (MDRL) has attracted attention for solving complex tasks. Two main challenges of MDRL are non-stationarity and partial observability from the perspective of agents, impacting the performance of agents’ learning cooperative policies. In this study, Controlled and Targeted Communication with the Centralized Critic (COTAC) is proposed, thereby constructing the paradigm of centralized learning and decentralized execution with partial communication. It is capable of decoupling how the MAS obtains environmental information during training and execution. Specifically, COTAC can make the environment faced by agents to be stationarity in the training phase and learn partial communication to overcome the limitation of partial observability in the execution phase. Based on this, decentralized actors learn controlled and targeted communication and policies optimized by centralized critics during training. As a result, agents comprehensively learn when to communicate during the sending and how to target information aggregation during the receiving. Apart from that, COTAC is evaluated on two multi-agent scenarios with continuous space. Experimental results demonstrated that partial agents with important information choose to send messages and targeted aggregate received information by identifying the relevant important information, which can still have better cooperation performance while reducing the communication traffic of the system.
引用
收藏
页码:14819 / 14837
页数:18
相关论文
共 50 条
  • [1] Learning controlled and targeted communication with the centralized critic for the multi-agent system
    Sun, Qingshuang
    Yao, Yuan
    Yi, Peng
    Hu, YuJiao
    Yang, Zhao
    Yang, Gang
    Zhou, Xingshe
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 14819 - 14837
  • [2] Multi-agent actor centralized-critic with communication
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    [J]. NEUROCOMPUTING, 2020, 390 : 40 - 56
  • [3] Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System
    Sun, Qingshuang
    Yao, Yuan
    Yi, Peng
    Zhou, Xingshe
    Yang, Gang
    [J]. COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2021, PT II, 2022, 1492 : 134 - 146
  • [4] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [5] Targeted Multi-Agent Communication with Deep Metric Learning
    Miao, Hua
    Yu, Nanxiang
    [J]. ENGINEERING LETTERS, 2023, 31 (02) : 712 - 723
  • [6] Exploring communication protocols and centralized critics in multi-agent deep learning
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    [J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2020, 27 (04) : 333 - 351
  • [7] A Study for Comparative Analysis of Dueling DQN and Centralized Critic Approaches in Multi-Agent Reinforcement Learning
    Sugimoto, Masashi
    Hasegawa, Kaito
    Ishida, Yuuki
    Ohnishi, Rikuto
    Nakagami, Kouki
    Tsuzuki, Shinji
    Urushihara, Shiro
    Sori, Hitoshi
    [J]. JOURNAL OF ROBOTICS AND MECHATRONICS, 2024, 36 (03) : 589 - 602
  • [8] TarMAC: Targeted Multi-Agent Communication
    Das, Abhishek
    Gervet, Theophile
    Romoff, Joshua
    Batra, Dhruv
    Parikh, Devi
    Rabbat, Michael
    Pineau, Joelle
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [9] Centralized Critic per Knowledge for Cooperative Multi-Agent Game Environments
    Ferreira, Thais
    Clua, Esteban
    Kohwalter, Troy Costa
    [J]. 2021 20TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2021), 2021, : 39 - 48
  • [10] On Centralized Critics in Multi-Agent Reinforcement Learning
    Lyu, Xueguang
    Baisero, Andrea
    Xiao, Yuchen
    Daley, Brett
    Amato, Christopher
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 295 - 354