Learning Individually Inferred Communication for Multi-Agent Cooperation

被引:0
|
作者
Ding, Ziluo [1 ]
Huang, Tiejun [1 ]
Lu, Zongqing [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
关键词
LEVEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Communication lays the foundation for human cooperation. It is also crucial for multi-agent cooperation. However, existing work focuses on broadcast communication, which is not only impractical but also leads to information redundancy that could even impair the learning process. To tackle these difficulties, we propose Individually Inferred Communication (I2C), a simple yet effective model to enable agents to learn a prior for agent-agent communication. The prior knowledge is learned via causal inference and realized by a feed-forward neural network that maps the agent's local observation to a belief about who to communicate with. The influence of one agent on another is inferred via the joint action-value function in multi-agent reinforcement learning and quantified to label the necessity of agent-agent communication. Furthermore, the agent policy is regularized to better exploit communicated messages. Empirically, we show that I2C can not only reduce communication overhead but also improve the performance in a variety of multi-agent cooperative scenarios, comparing to existing methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Learning Attentional Communication for Multi-Agent Cooperation
    Jiang, Jiechuan
    Lu, Zongqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Learning multi-agent cooperation
    Rivera, Corban
    Staley, Edward
    Llorens, Ashley
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [3] Simultaneous Policy and Discrete Communication Learning for Multi-Agent Cooperation
    Freed, Benjamin
    Sartoretti, Guillaume
    Choset, Howie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02): : 2498 - 2505
  • [4] Sparse Discrete Communication Learning for Multi-Agent Cooperation Through Backpropagation
    Freed, Benjamin
    James, Rohan
    Sartoretti, Guillaume
    Choset, Howie
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7993 - 7998
  • [5] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Knowledge-guided communication preference learning model for multi-agent cooperation
    Zhang, Han
    Yu, Hang
    Wang, Xiaoming
    Wang, Mengke
    Zhang, Zhenyu
    Li, Yang
    Xie, Shaorong
    Luo, Xiangfeng
    INFORMATION SCIENCES, 2024, 667
  • [7] Analysis about efficiency of indirect media communication on multi-agent cooperation learning
    Zhao, Gang
    Sun, Ruoying
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4180 - +
  • [8] Multi-agent communication cooperation based on deep reinforcement learning and information theory
    Gao, Bing
    Zhang, Zhejie
    Zou, Qijie
    Liu, Zhiguo
    Zhao, Xiling
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (18):
  • [9] ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model
    Han, Xue
    Yan, Hongping
    Zhang, Junge
    Wang, Lingfeng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 219 - 229
  • [10] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920