Benchmarking Deep and Non-deep Reinforcement Learning Algorithms for Discrete Environments

被引：0

作者：

Duarte, Fernando F. ^{[1
]}

Lau, Nuno ^{[1
]}

Pereira, Artur ^{[1
]}

Reis, Luis P. ^{[2
]}

机构：

[1] Univ Aveiro, IEETA, Aveiro, Portugal

[2] Univ Porto, LIACC, Porto, Portugal

来源：

FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2 | 2020年 / 1093卷

关键词：

Reinforcement Learning; Planning; Deep Q-Network; Q-Learning; Value Iteration; Neural Fitted Q-Iteration; Policy gradient optimization;

D O I：

10.1007/978-3-030-36150-1_22

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Given the plethora of Reinforcement Learning algorithms available in the literature, it can prove challenging to decide on the most appropriate one to use in order to solve a given Reinforcement Learning task. This work presents a benchmark study on the performance of several Reinforcement Learning algorithms for discrete learning environments. The study includes several deep as well as non-deep learning algorithms, with special focus on the Deep Q-Network algorithm and its variants. Neural Fitted Q-Iteration, the predecessor of Deep Q-Network as well as Vanilla Policy Gradient and a planner were also included in this assessment in order to provide a wider range of comparison between different approaches and paradigms. Three learning environments were used in order to carry out the tests, including a 2D maze and two OpenAI Gym environments, namely a custom-built Foraging/Tagging environment and the CartPole environment.

引用

页码：263 / 275

页数：13

共 50 条

[1] Non-Deep Active Learning for Deep Neural Networks
Kawano, Yasufumi
Nota, Yoshiki
Mochizuki, Rinpei
Aoki, Yoshimitsu
[J]. SENSORS, 2022, 22 (14)
[2] Non-deep Networks
Goyal, Ankit
Bochkovskiy, Alexey
Deng, Jia
Koltun, Vladlen
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[3] Benchmarking Deep Reinforcement Learning for Continuous Control
Duan, Yan
Chen, Xi
Houthooft, Rein
Schulman, John
Abbeel, Pieter
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[4] Adaptive deep reinforcement learning for non-stationary environments
Jin ZHU
Yutong WEI
Yu KANG
Xiaofeng JIANG
Geir E.DULLERUD
[J]. Science China(Information Sciences), 2022, (10) : 225 - 241
[5] Adaptive deep reinforcement learning for non-stationary environments
Zhu, Jin
Wei, Yutong
Kang, Yu
Jiang, Xiaofeng
Dullerud, Geir E.
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
[6] Adaptive deep reinforcement learning for non-stationary environments
Jin Zhu
Yutong Wei
Yu Kang
Xiaofeng Jiang
Geir E. Dullerud
[J]. Science China Information Sciences, 2022, 65
[7] Benchmarking Off-Policy Deep Reinforcement Learning Algorithms for UAV Path Planning
Garg, Shaswat
Masnavi, Houman
Fidan, Baris
Janabi-Sharifi, Farrokh
Mantegh, Iraj
[J]. 2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 317 - 323
[8] Emphatic Algorithms for Deep Reinforcement Learning
Jiang, Ray
Zahavy, Tom
Xu, Zhongwen
White, Adam
Hessel, Matteo
Blundell, Charles
van Hasselt, Hado
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[9] A Review of Optimization Method in Face Recognition: Comparison Deep Learning and Non-Deep Learning Methods
Setiowati, Sulis
Zulfanahri
Franita, Eka Legya
Ardiyanto, Igi
[J]. 2017 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2017,
[10] A Comparative Study of non-deep Learning, Deep Learning, and Ensemble Learning Methods for Sunspot Number Prediction
Dang, Yuchen
Chen, Ziqi
Li, Heng
Shu, Hai
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)

← 1 2 3 4 5 →