Learning controlled and targeted communication with the centralized critic for the multi-agent system

被引:0
|
作者
Qingshuang Sun
Yuan Yao
Peng Yi
YuJiao Hu
Zhao Yang
Gang Yang
Xingshe Zhou
机构
[1] Northwestern Polytechnical University,School of Computer Science
[2] Purple Mountain Laboratories,Future network center
来源
Applied Intelligence | 2023年 / 53卷
关键词
Reinforcement learning; Centralized critic; Communication; Cooperation; Multi-agent system;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-agent deep reinforcement learning (MDRL) has attracted attention for solving complex tasks. Two main challenges of MDRL are non-stationarity and partial observability from the perspective of agents, impacting the performance of agents’ learning cooperative policies. In this study, Controlled and Targeted Communication with the Centralized Critic (COTAC) is proposed, thereby constructing the paradigm of centralized learning and decentralized execution with partial communication. It is capable of decoupling how the MAS obtains environmental information during training and execution. Specifically, COTAC can make the environment faced by agents to be stationarity in the training phase and learn partial communication to overcome the limitation of partial observability in the execution phase. Based on this, decentralized actors learn controlled and targeted communication and policies optimized by centralized critics during training. As a result, agents comprehensively learn when to communicate during the sending and how to target information aggregation during the receiving. Apart from that, COTAC is evaluated on two multi-agent scenarios with continuous space. Experimental results demonstrated that partial agents with important information choose to send messages and targeted aggregate received information by identifying the relevant important information, which can still have better cooperation performance while reducing the communication traffic of the system.
引用
收藏
页码:14819 / 14837
页数:18
相关论文
共 50 条
  • [21] Actor-Attention-Critic for Multi-Agent Reinforcement Learning
    Iqbal, Shariq
    Sha, Fei
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [22] Multi-Agent Communication System with Chatbots
    Memon, Zojan
    Jalbani, Akhtar Hussain
    Shaikh, Mohsin
    Memon, Rafianaz
    Ali, Ahmed
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2018, 37 (03) : 663 - 672
  • [23] Addition of Learning to Critic Agent as a Solution to the Multi-Agent Credit Assignment Problem
    Rahaie, Zahra
    Beigy, Hamid
    [J]. 2009 FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS IN SYSTEM ANALYSIS, DECISION AND CONTROL, 2010, : 219 - 222
  • [24] Learning structured communication for multi-agent reinforcement learning
    Sheng, Junjie
    Wang, Xiangfeng
    Jin, Bo
    Yan, Junchi
    Li, Wenhao
    Chang, Tsung-Hui
    Wang, Jun
    Zha, Hongyuan
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [25] Learning structured communication for multi-agent reinforcement learning
    Junjie Sheng
    Xiangfeng Wang
    Bo Jin
    Junchi Yan
    Wenhao Li
    Tsung-Hui Chang
    Jun Wang
    Hongyuan Zha
    [J]. Autonomous Agents and Multi-Agent Systems, 2022, 36
  • [26] Neighborhood-Oriented Decentralized Learning Communication in Multi-Agent System
    Dai, Hao
    Wu, Jiashu
    Brinkmann, Andre
    Wang, Yang
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 490 - 502
  • [27] Learning Attentional Communication for Multi-Agent Cooperation
    Jiang, Jiechuan
    Lu, Zongqing
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [28] Coordinating Multi-Agent Navigation by Learning Communication
    Hildreth, Dalto N.
    Guy, Stephen J.
    [J]. PROCEEDINGS OF THE ACM ON COMPUTER GRAPHICS AND INTERACTIVE TECHNIQUES, 2019, 2 (02)
  • [29] Learning to Ground Multi-Agent Communication with Autoencoders
    Lin, Toru
    Huh, Minyoung
    Stauffer, Chris
    Lim, Ser-Nam
    Isola, Phillip
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [30] A communication architecture for multi-agent learning systems
    Ireson, N
    Cao, YJ
    Bull, L
    Miles, R
    [J]. REAL-WORLD APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2000, 1803 : 255 - 266