Multi-Agent Incentive Communication via Decentralized Teammate Modeling

被引:0
|
作者
Yuan, Lei [1 ,3 ]
Wang, Jianhao [2 ]
Zhang, Fuxiang [1 ]
Wang, Chenghe [1 ]
Zhang, Zongzhang [1 ]
Yu, Yang [1 ,3 ,4 ]
Zhang, Chongjie [2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
[3] Polixir Technol, Nanjing 210000, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective communication can improve coordination in cooperative multi-agent reinforcement learning (MARL). One popular communication scheme is exchanging agents' local observations or latent embeddings and using them to augment individual local policy input. Such a communication paradigm can reduce uncertainty for local decision-making and induce implicit coordination. However, it enlarges agents' local policy spaces and increases learning complexity, leading to poor coordination in complex settings. To handle this limitation, this paper proposes a novel framework named Multi-Agent Incentive Communication (MAIC) that allows each agent to learn to generate incentive messages and bias other agents' value functions directly, resulting in effective explicit coordination. Our method firstly learns targeted teammate models, with which each agent can anticipate the teammate's action selection and generate tailored messages to specific agents. We further introduce a novel regularization to leverage interaction sparsity and improve communication efficiency. MAIC is agnostic to specific MARL algorithms and can be flexibly integrated with different value function factorization methods. Empirical results demonstrate that our method significantly outperforms baselines and achieves excellent performance on multiple cooperative MARL tasks.
引用
收藏
页码:9466 / 9474
页数:9
相关论文
共 50 条
  • [1] Multi-agent cooperative strategy with explicit teammate modeling and targeted informative communication
    Jiang, Rui
    Zhang, Xuetao
    Liu, Yisha
    Xu, Yi
    Zhang, Xuebo
    Zhuang, Yan
    NEUROCOMPUTING, 2024, 586
  • [2] Decentralized multi-agent cooperation via adaptive partner modeling
    Xu, Chenhang
    Wang, Jia
    Zhu, Xiaohui
    Yue, Yong
    Zhou, Weifeng
    Liang, Zhixuan
    Wojtczak, Dominik
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4989 - 5004
  • [3] Decentralized communication strategies for coordinated multi-agent policies
    Roth, M
    Simmons, R
    Veloso, M
    Multi-Robot Systems - From Swarms to Intelligent Automata Vol III, 2005, : 93 - 105
  • [4] Periodic communication logics for the decentralized control of multi-agent systems
    Sun, YS
    Lemmon, MD
    2005 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA), VOLS 1AND 2, 2005, : 1431 - 1434
  • [5] Decentralized Localization for Multi-agent Systems Based on Asynchronous Communication
    Lu, Lei
    Hu, Jinwen
    Pan, Quan
    Zhao, Chunhui
    Xu, Zhao
    Jia, Caijuan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5489 - 5494
  • [6] Decentralized Multi-agent Coordination under MITL Specifications and Communication Constraints
    Wang, Wei
    Schuppe, Georg Friedrich
    Tumova, Jana
    2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 842 - 849
  • [7] Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
    Lidard, Justin
    Madhushani, Udari
    Leonard, Naomi Ehrich
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3311 - 3316
  • [8] Decentralized Communication Range Adjustment Issues in Multi-Agent Mobile Networks
    Stergiopoulos, John
    Tzes, Anthony
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 1629 - 1634
  • [9] Decentralized Multi-agent Coordination under MITL Tasks and Communication Constraints
    Wang, Wei
    Schuppe, Georg Friedrich
    Tumova, Jana
    2022 13TH ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS 2022), 2022, : 320 - 321
  • [10] Neighborhood-Oriented Decentralized Learning Communication in Multi-Agent System
    Dai, Hao
    Wu, Jiashu
    Brinkmann, Andre
    Wang, Yang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 490 - 502