Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods

被引:0
|
作者
Wang Qisheng [1 ]
Wang Qichao [1 ]
Li Xiao [1 ]
机构
[1] Southeast Univ, Sch Informat Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploration efficiency challenges for multi-agent reinforcement learning (MARL), as the policy learned by confederate MARL depends on the interaction among agents. Less informative reward also restricts the learning speed of MARL in comparison with the informative label in supervised learning. This paper proposes a novel communication method which helps agents focus on different exploration subarea to guide MARL to accelerate exploration. We propose a predictive network to forecast the reward of current state-action pair and use the guidance learned by the predictive network to modify the reward function. An improved prioritized experience replay is employed to help agents better take advantage of the different knowledge learned by different agents. Experimental results demonstrate that the proposed algorithm outperforms existing methods in cooperative multi-agent environments.
引用
收藏
页码:13949 / 13950
页数:2
相关论文
共 50 条
  • [1] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    [J]. 2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [2] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
    Liu, Iou-Jen
    Jain, Unnat
    Yeh, Raymond A.
    Schwing, Alexander G.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Adaptive Average Exploration in Multi-Agent Reinforcement Learning
    Hall, Garrett
    Holladay, Ken
    [J]. 2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
  • [4] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
    Graham, Caoimhin
    Bell, David
    Luo, Zhihui
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
  • [5] LMRL: A multi-agent reinforcement learning model and algorithm
    Wang, BN
    Gao, Y
    Chen, ZQ
    Xie, JY
    Chen, SF
    [J]. Third International Conference on Information Technology and Applications, Vol 1, Proceedings, 2005, : 303 - 307
  • [6] A new accelerating algorithm for multi-agent reinforcement learning
    张汝波
    仲宇
    顾国昌
    [J]. Journal of Harbin Institute of Technology(New series), 2005, (01) : 48 - 51
  • [7] Sequence to Sequence Multi-agent Reinforcement Learning Algorithm
    Shi, Tengfei
    Wang, Li
    Huang, Zirong
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 206 - 213
  • [8] Strangeness-driven exploration in multi-agent reinforcement learning
    Kim, Ju-Bong
    Choi, Ho-Bin
    Han, Youn-Hee
    [J]. NEURAL NETWORKS, 2024, 172
  • [9] Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
    Zhang, Yanqiang
    Feng, Dawei
    Ding, Bo
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 358 - 372
  • [10] UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
    Gupta, Tarun
    Mahajan, Anuj
    Peng, Bei
    Bohmer, Wendelin
    Whiteson, Shimon
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139