Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning

被引:0
|
作者
Hong, Seung-Jin [1 ]
Lee, Sang-Kwang [2 ]
机构
[1] Univ Sci & Technol, Sch ICT, Daejeon, South Korea
[2] Elect & Telecommun Res Inst, Daejeon, South Korea
关键词
D O I
10.1109/ICTC52510.2021.9620745
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The cooperative behavior is important skill in many real-world applications. Recently, many works have used the multi-agent platform to solve the real-world applications. However, it is difficult to learn the cooperative behaviors with equal rewards that the environment provides without considering the contributions. In this paper, we propose a method for learning cooperative behaviors in the centralized multi-agent environment. Firstly, we implement a reward model to predict the average rewards of all agents. And then, we use the reward model for calculating the contributions. The proposed method allows the model to distinguish which agent behaves better for team success. In order to evaluate the performance of the proposed method, we compute the average team rewards on the multi-agent battle environment. Experimental results show that the proposed method has better performance than the baseline using the cooperative behaviors.
引用
收藏
页码:1697 / 1699
页数:3
相关论文
共 50 条
  • [31] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
    Zhou, Meng
    Liu, Ziyu
    Sui, Pengwei
    Li, Yixuan
    Chung, Yuk Ying
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [32] Enhanced Cooperative Multi-agent Learning Algorithms (ECMLA) using Reinforcement Learning
    Vidhate, Deepak A.
    Kulkarni, Parag
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 556 - 561
  • [33] LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
    Yang, Mingyu
    Zhao, Jian
    Hu, Xunhan
    Zhou, Wengang
    Zhu, Jiangcheng
    Li, Houqiang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [34] Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
    Li, Chao
    Zhang, Yupeng
    Wang, Jianqi
    Hu, Yujing
    Dong, Shaokang
    Li, Wenbin
    Lv, Tangjie
    Fan, Changjie
    Gao, Yang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17453 - 17460
  • [35] Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
    Mu, Ronghui
    Ruan, Wenjie
    Marcolino, Leandro Soriano
    Jin, Gaojie
    Ni, Qiang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15046 - 15054
  • [36] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    [J]. Applied Intelligence, 2023, 53 : 9261 - 9269
  • [37] Cooperative targets assignment based on multi-agent reinforcement learning
    Ma, Yue
    Wu, Lin
    Xu, Xiao
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
  • [38] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
    Ocana, Jim Martin Catacora
    Riccio, Francesco
    Capobianco, Roberto
    Nardi, Daniele
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
  • [39] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
    Javalera-Rincon, Valeria
    Puig Cayuela, Vicenc
    Morcego Seix, Bernardo
    Orduna-Cabrera, Fernando
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
  • [40] Transform networks for cooperative multi-agent deep reinforcement learning
    Wang, Hongbin
    Xie, Xiaodong
    Zhou, Lianke
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269