Benchmarking Deep and Non-deep Reinforcement Learning Algorithms for Discrete Environments

被引:0
|
作者
Duarte, Fernando F. [1 ]
Lau, Nuno [1 ]
Pereira, Artur [1 ]
Reis, Luis P. [2 ]
机构
[1] Univ Aveiro, IEETA, Aveiro, Portugal
[2] Univ Porto, LIACC, Porto, Portugal
关键词
Reinforcement Learning; Planning; Deep Q-Network; Q-Learning; Value Iteration; Neural Fitted Q-Iteration; Policy gradient optimization;
D O I
10.1007/978-3-030-36150-1_22
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given the plethora of Reinforcement Learning algorithms available in the literature, it can prove challenging to decide on the most appropriate one to use in order to solve a given Reinforcement Learning task. This work presents a benchmark study on the performance of several Reinforcement Learning algorithms for discrete learning environments. The study includes several deep as well as non-deep learning algorithms, with special focus on the Deep Q-Network algorithm and its variants. Neural Fitted Q-Iteration, the predecessor of Deep Q-Network as well as Vanilla Policy Gradient and a planner were also included in this assessment in order to provide a wider range of comparison between different approaches and paradigms. Three learning environments were used in order to carry out the tests, including a 2D maze and two OpenAI Gym environments, namely a custom-built Foraging/Tagging environment and the CartPole environment.
引用
收藏
页码:263 / 275
页数:13
相关论文
共 50 条
  • [1] Non-Deep Active Learning for Deep Neural Networks
    Kawano, Yasufumi
    Nota, Yoshiki
    Mochizuki, Rinpei
    Aoki, Yoshimitsu
    [J]. SENSORS, 2022, 22 (14)
  • [2] Non-deep Networks
    Goyal, Ankit
    Bochkovskiy, Alexey
    Deng, Jia
    Koltun, Vladlen
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [3] Benchmarking Deep Reinforcement Learning for Continuous Control
    Duan, Yan
    Chen, Xi
    Houthooft, Rein
    Schulman, John
    Abbeel, Pieter
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [4] Adaptive deep reinforcement learning for non-stationary environments
    Jin ZHU
    Yutong WEI
    Yu KANG
    Xiaofeng JIANG
    Geir E.DULLERUD
    [J]. Science China(Information Sciences), 2022, (10) : 225 - 241
  • [5] Adaptive deep reinforcement learning for non-stationary environments
    Zhu, Jin
    Wei, Yutong
    Kang, Yu
    Jiang, Xiaofeng
    Dullerud, Geir E.
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)
  • [6] Adaptive deep reinforcement learning for non-stationary environments
    Jin Zhu
    Yutong Wei
    Yu Kang
    Xiaofeng Jiang
    Geir E. Dullerud
    [J]. Science China Information Sciences, 2022, 65
  • [7] Benchmarking Off-Policy Deep Reinforcement Learning Algorithms for UAV Path Planning
    Garg, Shaswat
    Masnavi, Houman
    Fidan, Baris
    Janabi-Sharifi, Farrokh
    Mantegh, Iraj
    [J]. 2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 317 - 323
  • [8] Emphatic Algorithms for Deep Reinforcement Learning
    Jiang, Ray
    Zahavy, Tom
    Xu, Zhongwen
    White, Adam
    Hessel, Matteo
    Blundell, Charles
    van Hasselt, Hado
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] A Review of Optimization Method in Face Recognition: Comparison Deep Learning and Non-Deep Learning Methods
    Setiowati, Sulis
    Zulfanahri
    Franita, Eka Legya
    Ardiyanto, Igi
    [J]. 2017 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2017,
  • [10] A Comparative Study of non-deep Learning, Deep Learning, and Ensemble Learning Methods for Sunspot Number Prediction
    Dang, Yuchen
    Chen, Ziqi
    Li, Heng
    Shu, Hai
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)