Centralized reinforcement learning for multi-agent cooperative environments

被引：3

作者：

Lu, Chengxuan ^{[1
]}

Bao, Qihao ^{[1
]}

Xia, Shaojie ^{[1
]}

Qu, Chongxiao ^{[1
]}

机构：

[1] 52Nd Res Inst China Elect Technol Grp Corp, 9 Wenfu Rd, Hangzhou 311100, Peoples R China

来源：

EVOLUTIONARY INTELLIGENCE | 2024年 / 17卷 / 01期

关键词：

Centralized; Reinforcement Learning; Attention; Multi-Agent; Combinatorial Explosion; LEVEL;

D O I：

10.1007/s12065-022-00703-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study reinforcement learning methods in multi-agent domains where a central controller collects all information and decides an action for every agent. However, multi-agent reinforcement learning (MARL) suffers from the combinatorial explosion of action space. In this work, we propose an improved proximal policy optimization (PPO) algorithm, whose neural network is based on attention mechanism, to solve the combinatorial explosion issue. Our model outputs joint-action instead of distributed action. Parameter sharing of attention mechanism enables the size of neural network linearly with local observation's length of single agent regardless of the agents' number. Besides, credit assignment of multi-agent is naturally addressed by gradient ascent in the attention layer. Experiment results demonstrate that our method outperforms independent PPO and centralized PPO with other networks.

引用

页码：267 / 273

页数：7

共 50 条

[1] Centralized reinforcement learning for multi-agent cooperative environments
Chengxuan Lu
Qihao Bao
Shaojie Xia
Chongxiao Qu
[J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
[2] On Centralized Critics in Multi-Agent Reinforcement Learning
Lyu, Xueguang
Baisero, Andrea
Xiao, Yuchen
Daley, Brett
Amato, Christopher
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 295 - 354
[3] On Centralized Critics in Multi-Agent Reinforcement Learning
Lyu, Xueguang
Baisero, Andrea
Xiao, Yuchen
Daley, Brett
Amato, Christopher
[J]. Journal of Artificial Intelligence Research, 2023, 77 : 295 - 354
[4] SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning
Wen, Chao
Yao, Xinghu
Wang, Yuhui
Tan, Xiaoyang
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7301 - 7308
[5] An Efficient Centralized Multi-Agent Reinforcement Learner for Cooperative Tasks
Liao, Dengyu
Zhang, Zhen
Song, Tingting
Liu, Mingyang
[J]. IEEE ACCESS, 2023, 11 : 139284 - 139294
[6] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
Chen, Hao
Yang, Guangkai
Zhang, Junge
Yin, Qiyue
Huang, Kaiqi
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[7] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
Lin, Jieyu
Dzeparoska, Kristina
Zhang, Sai Qian
Leon-Garcia, Alberto
Papernot, Nicolas
[J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
[8] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
Mahnoosh Mahdavimoghadam
Amin Nikanjam
Monireh Abdoos
[J]. The Journal of Supercomputing, 2022, 78 : 10455 - 10479
[9] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
Mahdavimoghadam, Mahnoosh
Nikanjam, Amin
Abdoos, Monireh
[J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (08): : 10455 - 10479
[10] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734

← 1 2 3 4 5 →