Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning

被引:0
|
作者
Zhang, Yanqiang [1 ]
Feng, Dawei [1 ]
Ding, Bo [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Peoples R China
关键词
Multi-agent Systems; Reinforcement Learning; Intrinsic Reward;
D O I
10.1007/978-981-99-8082-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning methods have shown significant progress, however, they continue to exhibit exploration problems in complex and challenging environments. To address the above issue, current research has introduced several exploration-enhanced methods for multi-agent reinforcement learning, they are still faced with the issues of inefficient exploration and low performance in challenging tasks that necessitate complex cooperation among agents. This paper proposes the prediction-action Qmix (PQmix) method, an action prediction-based multi-agent intrinsic reward construction approach. The PQmix method employs the joint local observation of agents and the next joint local observation after executing actions to predict the real joint action of agents. The method calculates the action prediction error as the intrinsic reward to measure the novel of the joint state and encourages agents to actively explore the action and state spaces in the environment. We compare PQmix with strong baselines on the MARL benchmark to validate it. The result of experiments demonstrates that PQmix outperforms the state-of-the-art algorithms on the StarCraft Multi-Agent Challenge (SMAC). In the end, the stability of the method is verified by experiments.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 50 条
  • [41] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
    Wan, Kejia
    Xu, Xinhai
    Li, Yuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555
  • [42] Pacesetter Learning for Large Scale Cooperative Multi-Agent Reinforcement Learning
    Zhou, Pingqi
    Li, Chao
    Qiu, Mengwei
    Liu, Jun
    Ma, Chennan
    Yan, Ming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 115 - 126
  • [43] Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
    Zimmer, Matthieu
    Glanois, Claire
    Siddique, Umer
    Weng, Paul
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [44] QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning
    Son, Kyunghwan
    Kim, Daewoo
    Kang, Wan Ju
    Hostallero, David
    Yi, Yung
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [45] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
    Zhou, Meng
    Liu, Ziyu
    Sui, Pengwei
    Li, Yixuan
    Chung, Yuk Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [46] Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
    Mu, Ronghui
    Ruan, Wenjie
    Marcolino, Leandro Soriano
    Jin, Gaojie
    Ni, Qiang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15046 - 15054
  • [47] Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning
    Li, Chao
    Zhang, Yupeng
    Wang, Jianqi
    Hu, Yujing
    Dong, Shaokang
    Li, Wenbin
    Lv, Tangjie
    Fan, Changjie
    Gao, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17453 - 17460
  • [48] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
    Ocana, Jim Martin Catacora
    Riccio, Francesco
    Capobianco, Roberto
    Nardi, Daniele
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
  • [49] Cooperative targets assignment based on multi-agent reinforcement learning
    Ma Y.
    Wu L.
    Xu X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
  • [50] Transform networks for cooperative multi-agent deep reinforcement learning
    Hongbin Wang
    Xiaodong Xie
    Lianke Zhou
    Applied Intelligence, 2023, 53 : 9261 - 9269