Learning Cooperative Intrinsic Motivation in Multi-Agent Reinforcement Learning

被引：0

作者：

Hong, Seung-Jin ^{[1
]}

Lee, Sang-Kwang ^{[2
]}

机构：

[1] Univ Sci & Technol, Sch ICT, Daejeon, South Korea

[2] Elect & Telecommun Res Inst, Daejeon, South Korea

来源：

12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION | 2021年

关键词：

D O I：

10.1109/ICTC52510.2021.9620745

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The cooperative behavior is important skill in many real-world applications. Recently, many works have used the multi-agent platform to solve the real-world applications. However, it is difficult to learn the cooperative behaviors with equal rewards that the environment provides without considering the contributions. In this paper, we propose a method for learning cooperative behaviors in the centralized multi-agent environment. Firstly, we implement a reward model to predict the average rewards of all agents. And then, we use the reward model for calculating the contributions. The proposed method allows the model to distinguish which agent behaves better for team success. In order to evaluate the performance of the proposed method, we compute the average team rewards on the multi-agent battle environment. Experimental results show that the proposed method has better performance than the baseline using the cooperative behaviors.

引用

页码：1697 / 1699

页数：3

共 50 条

[31] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
Zhou, Meng
Liu, Ziyu
Sui, Pengwei
Li, Yixuan
Chung, Yuk Ying
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[32] Enhanced Cooperative Multi-agent Learning Algorithms (ECMLA) using Reinforcement Learning
Vidhate, Deepak A.
Kulkarni, Parag
[J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 556 - 561
[33] LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
Yang, Mingyu
Zhao, Jian
Hu, Xunhan
Zhou, Wengang
Zhu, Jiangcheng
Li, Houqiang
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[34] Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
Li, Chao
Zhang, Yupeng
Wang, Jianqi
Hu, Yujing
Dong, Shaokang
Li, Wenbin
Lv, Tangjie
Fan, Changjie
Gao, Yang
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17453 - 17460
[35] Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
Mu, Ronghui
Ruan, Wenjie
Marcolino, Leandro Soriano
Jin, Gaojie
Ni, Qiang
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15046 - 15054
[36] Transform networks for cooperative multi-agent deep reinforcement learning
Hongbin Wang
Xiaodong Xie
Lianke Zhou
[J]. Applied Intelligence, 2023, 53 : 9261 - 9269
[37] Cooperative targets assignment based on multi-agent reinforcement learning
Ma, Yue
Wu, Lin
Xu, Xiao
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
[38] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
Ocana, Jim Martin Catacora
Riccio, Francesco
Capobianco, Roberto
Nardi, Daniele
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
[39] Reinforcement Learning Approach for Cooperative Control of Multi-Agent Systems
Javalera-Rincon, Valeria
Puig Cayuela, Vicenc
Morcego Seix, Bernardo
Orduna-Cabrera, Fernando
[J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 80 - 91
[40] Transform networks for cooperative multi-agent deep reinforcement learning
Wang, Hongbin
Xie, Xiaodong
Zhou, Lianke
[J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 9261 - 9269

← 1 2 3 4 5 →