Multi-Agent Incentive Communication via Decentralized Teammate Modeling

被引:0
|
作者
Yuan, Lei [1 ,3 ]
Wang, Jianhao [2 ]
Zhang, Fuxiang [1 ]
Wang, Chenghe [1 ]
Zhang, Zongzhang [1 ]
Yu, Yang [1 ,3 ,4 ]
Zhang, Chongjie [2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
[3] Polixir Technol, Nanjing 210000, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective communication can improve coordination in cooperative multi-agent reinforcement learning (MARL). One popular communication scheme is exchanging agents' local observations or latent embeddings and using them to augment individual local policy input. Such a communication paradigm can reduce uncertainty for local decision-making and induce implicit coordination. However, it enlarges agents' local policy spaces and increases learning complexity, leading to poor coordination in complex settings. To handle this limitation, this paper proposes a novel framework named Multi-Agent Incentive Communication (MAIC) that allows each agent to learn to generate incentive messages and bias other agents' value functions directly, resulting in effective explicit coordination. Our method firstly learns targeted teammate models, with which each agent can anticipate the teammate's action selection and generate tailored messages to specific agents. We further introduce a novel regularization to leverage interaction sparsity and improve communication efficiency. MAIC is agnostic to specific MARL algorithms and can be flexibly integrated with different value function factorization methods. Empirical results demonstrate that our method significantly outperforms baselines and achieves excellent performance on multiple cooperative MARL tasks.
引用
收藏
页码:9466 / 9474
页数:9
相关论文
共 50 条
  • [41] Decentralized control of connectivity for multi-agent systems
    De Gennaro, Maria Carmela
    Jadbabaie, Ali
    PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 3631 - +
  • [42] Centralized vs Decentralized Multi-Agent Guesswork
    Salamatian, Salman
    Beirami, Ahmad
    Cohen, Asaf
    Medard, Muriel
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 2258 - 2262
  • [43] Multi-Agent Coordination by Decentralized Estimation and Control
    Yang, Peng
    Freeman, Randy A.
    Lynch, Kevin M.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (11) : 2480 - 2496
  • [44] A Decentralized Multi-Agent Path Planning Approach Based on Imitation Learning and Selective Communication
    Feng, Bohan
    Bi, Youyi
    Li, Mian
    Lin, Liyong
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (08)
  • [45] Efficient agent communication in multi-agent systems
    Jang, MW
    Ahmed, A
    Agha, G
    SOFTWARE ENGINEERING FOR MULTI-AGENT SYSTEMS III: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2004, 3390 : 236 - 253
  • [46] Urban Traffic Light Control via Active Multi-Agent Communication and Supply-Demand Modeling
    Guo, Xin
    Yu, Zhengxu
    Wang, Pengfei
    Jin, Zhongming
    Huang, Jianqiang
    Cai, Deng
    He, Xiaofei
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4346 - 4356
  • [47] Multi-agent policy transfer via task relationship modeling
    Qin, Rongjun
    Chen, Feng
    Wang, Tonghan
    Yuan, Lei
    Wu, Xiaoran
    Kang, Yipeng
    Zhang, Zongzhang
    Zhang, Chongjie
    Yu, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (08)
  • [48] Multi-agent policy transfer via task relationship modeling
    Rongjun QIN
    Feng CHEN
    Tonghan WANG
    Lei YUAN
    Xiaoran WU
    Yipeng KANG
    Zongzhang ZHANG
    Chongjie ZHANG
    Yang YU
    Science China(Information Sciences), 2024, 67 (08) : 102 - 114
  • [49] Decentralized MCTS via Learned Teammate Models
    Czechowski, Aleksander
    Oliehoek, Frans A.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 81 - 88
  • [50] Behavior modeling based on multi-agent and multi-agent simulation environment
    Yin, QJ
    Du, XY
    Huang, K
    SYSTEM SIMULATION AND SCIENTIFIC COMPUTING, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1531 - 1536