Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning

被引:0
|
作者
Zhang, Yanqiang [1 ]
Feng, Dawei [1 ]
Ding, Bo [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Peoples R China
关键词
Multi-agent Systems; Reinforcement Learning; Intrinsic Reward;
D O I
10.1007/978-981-99-8082-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning methods have shown significant progress, however, they continue to exhibit exploration problems in complex and challenging environments. To address the above issue, current research has introduced several exploration-enhanced methods for multi-agent reinforcement learning, they are still faced with the issues of inefficient exploration and low performance in challenging tasks that necessitate complex cooperation among agents. This paper proposes the prediction-action Qmix (PQmix) method, an action prediction-based multi-agent intrinsic reward construction approach. The PQmix method employs the joint local observation of agents and the next joint local observation after executing actions to predict the real joint action of agents. The method calculates the action prediction error as the intrinsic reward to measure the novel of the joint state and encourages agents to actively explore the action and state spaces in the environment. We compare PQmix with strong baselines on the MARL benchmark to validate it. The result of experiments demonstrates that PQmix outperforms the state-of-the-art algorithms on the StarCraft Multi-Agent Challenge (SMAC). In the end, the stability of the method is verified by experiments.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 50 条
  • [1] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
    Liu, Iou-Jen
    Jain, Unnat
    Yeh, Raymond A.
    Schwing, Alexander G.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [2] Curiosity-driven Exploration for Cooperative Multi-Agent Reinforcement Learning
    Xu, Fanchao
    Kaneko, Tomoyuki
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [3] Diverse Effective Relationship Exploration for Cooperative Multi-Agent Reinforcement Learning
    Jiang, Hao
    Liu, Yuntao
    Li, Shengze
    Zhang, Jieyuan
    Xu, Xinhai
    Liu, Donghong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 842 - 851
  • [4] Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
    Zhao, Xutong
    Pan, Yangchen
    Xiao, Chenjun
    Chandar, Sarath
    Rajendran, Janarthanan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2529 - 2540
  • [5] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [6] Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction
    童亮
    陆际联
    Journal of Beijing Institute of Technology, 2006, (02) : 133 - 137
  • [7] Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning
    Zhang, Junkai
    Zhang, Yifan
    Zhang, Xi Sheryl
    Zang, Yifan
    Cheng, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17600 - 17608
  • [8] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning
    Carion, Nicolas
    Synnaeve, Gabriel
    Lazaric, Alessandro
    Usunier, Nicolas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68