Emergent cooperation from mutual acknowledgment exchange in multi-agent reinforcement learning

被引:1
|
作者
Phan, Thomy [1 ,2 ]
Sommer, Felix [2 ]
Ritz, Fabian [2 ]
Altmann, Philipp [2 ]
Nuesslein, Jonas [2 ]
Koelle, Michael [2 ]
Belzner, Lenz [3 ]
Linnhoff-Popien, Claudia [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Ludwig Maximilians Univ Munchen, Munich, Germany
[3] TH Ingolstadt, Ingolstadt, Germany
关键词
Multi-agent learning; Reinforcement learning; Mutual acknowledgments; Peer incentivization; Emergent cooperation; EVOLUTION; LEVEL;
D O I
10.1007/s10458-024-09666-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Peer incentivization (PI) is a recent approach where all agents learn to reward or penalize each other in a distributed fashion, which often leads to emergent cooperation. Current PI mechanisms implicitly assume a flawless communication channel in order to exchange rewards. These rewards are directly incorporated into the learning process without any chance to respond with feedback. Furthermore, most PI approaches rely on global information, which limits scalability and applicability to real-world scenarios where only local information is accessible. In this paper, we propose Mutual Acknowledgment Token Exchange (MATE), a PI approach defined by a two-phase communication protocol to exchange acknowledgment tokens as incentives to shape individual rewards mutually. All agents condition their token transmissions on the locally estimated quality of their own situations based on environmental rewards and received tokens. MATE is completely decentralized and only requires local communication and information. We evaluate MATE in three social dilemma domains. Our results show that MATE is able to achieve and maintain significantly higher levels of cooperation than previous PI approaches. In addition, we evaluate the robustness of MATE in more realistic scenarios, where agents can deviate from the protocol and communication failures can occur. We also evaluate the sensitivity of MATE w.r.t. the choice of token values.
引用
收藏
页数:36
相关论文
共 50 条
  • [41] Partitioning in multi-agent reinforcement learning
    Sun, R
    Peterson, T
    FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
  • [42] The Dynamics of Multi-Agent Reinforcement Learning
    Dickens, Luke
    Broda, Krysia
    Russo, Alessandra
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
  • [43] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [44] Emergent Collective Behaviors in a Multi-agent Reinforcement Learning Pedestrian Simulation: A Case Study
    Martinez-Gil, Francisco
    Lozano, Miguel
    Fernandez, Fernando
    MULTI-AGENT-BASED SIMULATION XV, 2015, 9002 : 228 - 238
  • [45] Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models
    Martinez-Gil, Francisco
    Lozano, Miguel
    Fernandez, Fernando
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 74 : 117 - 133
  • [46] Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning
    Hahn, Carsten
    Phan, Thomy
    Gabor, Thomas
    Belzner, Lenz
    Linnhoff-Popien, Claudia
    ALIFE 2019: THE 2019 CONFERENCE ON ARTIFICIAL LIFE, 2019, : 598 - 605
  • [47] Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning
    Tian, Qi
    Kuang, Kun
    Liu, Furui
    Wang, Baoxiang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11672 - 11680
  • [48] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [49] Importance-Aware Message Exchange and Prediction for Multi-Agent Reinforcement Learning
    Huang, Xiufeng
    Zhou, Sheng
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6493 - 6498
  • [50] Multi-agent deep reinforcement learning for online request scheduling in edge cooperation networks
    Zhang, Yaqiang
    Li, Ruyang
    Zhao, Yaqian
    Li, Rengang
    Wang, Yanwei
    Zhou, Zhangbing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 141 : 258 - 268