Centralized reinforcement learning for multi-agent cooperative environments

被引:3
|
作者
Lu, Chengxuan [1 ]
Bao, Qihao [1 ]
Xia, Shaojie [1 ]
Qu, Chongxiao [1 ]
机构
[1] 52Nd Res Inst China Elect Technol Grp Corp, 9 Wenfu Rd, Hangzhou 311100, Peoples R China
关键词
Centralized; Reinforcement Learning; Attention; Multi-Agent; Combinatorial Explosion; LEVEL;
D O I
10.1007/s12065-022-00703-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study reinforcement learning methods in multi-agent domains where a central controller collects all information and decides an action for every agent. However, multi-agent reinforcement learning (MARL) suffers from the combinatorial explosion of action space. In this work, we propose an improved proximal policy optimization (PPO) algorithm, whose neural network is based on attention mechanism, to solve the combinatorial explosion issue. Our model outputs joint-action instead of distributed action. Parameter sharing of attention mechanism enables the size of neural network linearly with local observation's length of single agent regardless of the agents' number. Besides, credit assignment of multi-agent is naturally addressed by gradient ascent in the attention layer. Experiment results demonstrate that our method outperforms independent PPO and centralized PPO with other networks.
引用
收藏
页码:267 / 273
页数:7
相关论文
共 50 条
  • [1] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    [J]. Evolutionary Intelligence, 2024, 17 : 267 - 273
  • [2] On Centralized Critics in Multi-Agent Reinforcement Learning
    Lyu, Xueguang
    Baisero, Andrea
    Xiao, Yuchen
    Daley, Brett
    Amato, Christopher
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 295 - 354
  • [3] On Centralized Critics in Multi-Agent Reinforcement Learning
    Lyu, Xueguang
    Baisero, Andrea
    Xiao, Yuchen
    Daley, Brett
    Amato, Christopher
    [J]. Journal of Artificial Intelligence Research, 2023, 77 : 295 - 354
  • [4] SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning
    Wen, Chao
    Yao, Xinghu
    Wang, Yuhui
    Tan, Xiaoyang
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7301 - 7308
  • [5] An Efficient Centralized Multi-Agent Reinforcement Learner for Cooperative Tasks
    Liao, Dengyu
    Zhang, Zhen
    Song, Tingting
    Liu, Mingyang
    [J]. IEEE ACCESS, 2023, 11 : 139284 - 139294
  • [6] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [7] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    [J]. 2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
  • [8] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
    Mahnoosh Mahdavimoghadam
    Amin Nikanjam
    Monireh Abdoos
    [J]. The Journal of Supercomputing, 2022, 78 : 10455 - 10479
  • [9] Improved reinforcement learning in cooperative multi-agent environments using knowledge transfer
    Mahdavimoghadam, Mahnoosh
    Nikanjam, Amin
    Abdoos, Monireh
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (08): : 10455 - 10479
  • [10] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734