Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods

被引：0

作者：

Wang Qisheng ^{[1
]}

Wang Qichao ^{[1
]}

Li Xiao ^{[1
]}

机构：

[1] Southeast Univ, Sch Informat Engn, Nanjing 210096, Peoples R China

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Exploration efficiency challenges for multi-agent reinforcement learning (MARL), as the policy learned by confederate MARL depends on the interaction among agents. Less informative reward also restricts the learning speed of MARL in comparison with the informative label in supervised learning. This paper proposes a novel communication method which helps agents focus on different exploration subarea to guide MARL to accelerate exploration. We propose a predictive network to forecast the reward of current state-action pair and use the guidance learned by the predictive network to modify the reward function. An improved prioritized experience replay is employed to help agents better take advantage of the different knowledge learned by different agents. Experimental results demonstrate that the proposed algorithm outperforms existing methods in cooperative multi-agent environments.

引用

页码：13949 / 13950

页数：2

共 50 条

[1] Multi-agent Exploration with Reinforcement Learning
Sygkounas, Alkis
Tsipianitis, Dimitris
Nikolakopoulos, George
Bechlioulis, Charalampos P.
[J]. 2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
[2] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Liu, Iou-Jen
Jain, Unnat
Yeh, Raymond A.
Schwing, Alexander G.
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[3] Adaptive Average Exploration in Multi-Agent Reinforcement Learning
Hall, Garrett
Holladay, Ken
[J]. 2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
[4] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning
Graham, Caoimhin
Bell, David
Luo, Zhihui
[J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298
[5] LMRL: A multi-agent reinforcement learning model and algorithm
Wang, BN
Gao, Y
Chen, ZQ
Xie, JY
Chen, SF
[J]. Third International Conference on Information Technology and Applications, Vol 1, Proceedings, 2005, : 303 - 307
[6] A new accelerating algorithm for multi-agent reinforcement learning
张汝波
仲宇
顾国昌
[J]. Journal of Harbin Institute of Technology(New series), 2005, (01) : 48 - 51
[7] Sequence to Sequence Multi-agent Reinforcement Learning Algorithm
Shi, Tengfei
Wang, Li
Huang, Zirong
[J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 206 - 213
[8] Strangeness-driven exploration in multi-agent reinforcement learning
Kim, Ju-Bong
Choi, Ho-Bin
Han, Youn-Hee
[J]. NEURAL NETWORKS, 2024, 172
[9] Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
Zhang, Yanqiang
Feng, Dawei
Ding, Bo
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 358 - 372
[10] UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Gupta, Tarun
Mahajan, Anuj
Peng, Bei
Bohmer, Wendelin
Whiteson, Shimon
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139

← 1 2 3 4 5 →