Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning

被引:0
|
作者
Zhang, Yanqiang [1 ]
Feng, Dawei [1 ]
Ding, Bo [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Peoples R China
关键词
Multi-agent Systems; Reinforcement Learning; Intrinsic Reward;
D O I
10.1007/978-981-99-8082-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent reinforcement learning methods have shown significant progress, however, they continue to exhibit exploration problems in complex and challenging environments. To address the above issue, current research has introduced several exploration-enhanced methods for multi-agent reinforcement learning, they are still faced with the issues of inefficient exploration and low performance in challenging tasks that necessitate complex cooperation among agents. This paper proposes the prediction-action Qmix (PQmix) method, an action prediction-based multi-agent intrinsic reward construction approach. The PQmix method employs the joint local observation of agents and the next joint local observation after executing actions to predict the real joint action of agents. The method calculates the action prediction error as the intrinsic reward to measure the novel of the joint state and encourages agents to actively explore the action and state spaces in the environment. We compare PQmix with strong baselines on the MARL benchmark to validate it. The result of experiments demonstrates that PQmix outperforms the state-of-the-art algorithms on the StarCraft Multi-Agent Challenge (SMAC). In the end, the stability of the method is verified by experiments.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 50 条
  • [21] Cooperative Multi-agent Reinforcement Learning for Inventory Management
    Khirwar, Madhav
    Gurumoorthy, Karthik S.
    Jain, Ankit Ajit
    Manchenahally, Shantala
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 619 - 634
  • [22] The Cooperative Reinforcement Learning in a Multi-Agent Design System
    Liu, Hong
    Wang, Jihua
    PROCEEDINGS OF THE 2013 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2013, : 139 - 144
  • [23] Cooperative Multi-Agent Reinforcement Learning in Express System
    Li, Yexin
    Zheng, Yu
    Yang, Qiang
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 805 - 814
  • [24] A review of cooperative multi-agent deep reinforcement learning
    Afshin Oroojlooy
    Davood Hajinezhad
    Applied Intelligence, 2023, 53 : 13677 - 13722
  • [25] Levels of Realism for Cooperative Multi-agent Reinforcement Learning
    Cunningham, Bryan
    Cao, Yong
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT I, 2012, 7331 : 573 - 582
  • [26] Centralized reinforcement learning for multi-agent cooperative environments
    Chengxuan Lu
    Qihao Bao
    Shaojie Xia
    Chongxiao Qu
    Evolutionary Intelligence, 2024, 17 : 267 - 273
  • [27] Centralized reinforcement learning for multi-agent cooperative environments
    Lu, Chengxuan
    Bao, Qihao
    Xia, Shaojie
    Qu, Chongxiao
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
  • [28] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [29] Reinforcement learning of coordination in cooperative multi-agent systems
    Kapetanakis, S
    Kudenko, D
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
  • [30] Training Cooperative Agents for Multi-Agent Reinforcement Learning
    Bhalla, Sushrut
    Subramanian, Sriram G.
    Crowley, Mark
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828