Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm

被引:42
|
作者
Ashraf, Nesma M. [1 ]
Mostafa, Reham R. [2 ]
Sakr, Rasha H. [1 ]
Rashad, M. Z. [1 ]
机构
[1] Mansoura Univ, Fac Comp & Informat Sci, Comp Sci Dept, Mansoura, Egypt
[2] Mansoura Univ, Fac Comp & Informat Sci, Informat Syst Dept, Mansoura, Egypt
来源
PLOS ONE | 2021年 / 16卷 / 06期
关键词
LEVEL; GAME; GO;
D O I
10.1371/journal.pone.0252754
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Deep Reinforcement Learning (DRL) enables agents to make decisions based on a well-designed reward function that suites a particular environment without any prior knowledge related to a given environment. The adaptation of hyperparameters has a great impact on the overall learning process and the learning processing times. Hyperparameters should be accurately estimated while training DRL algorithms, which is one of the key challenges that we attempt to address. This paper employs a swarm-based optimization algorithm, namely the Whale Optimization Algorithm (WOA), for optimizing the hyperparameters of the Deep Deterministic Policy Gradient (DDPG) algorithm to achieve the optimum control strategy in an autonomous driving control problem. DDPG is capable of handling complex environments, which contain continuous spaces for actions. To evaluate the proposed algorithm, the Open Racing Car Simulator (TORCS), a realistic autonomous driving simulation environment, was chosen to its ease of design and implementation. Using TORCS, the DDPG agent with optimized hyperparameters was compared with a DDPG agent with reference hyperparameters. The experimental results showed that the DDPG's hyperparameters optimization leads to maximizing the total rewards, along with testing episodes and maintaining a stable driving policy.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] A Deep Q-Network Reinforcement Learning-Based Model for Autonomous Driving
    Ahmed, Marwa
    Lim, Chee Peng
    Nahavandi, Saeid
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 739 - 744
  • [32] Deep Reinforcement Learning Based on the Hindsight Experience Replay for Autonomous Driving of Mobile Robot
    Park M.
    Hong J.S.
    Kwon N.K.
    Journal of Institute of Control, Robotics and Systems, 2022, 28 (11): : 1006 - 1012
  • [33] Lateral Motion Control for Obstacle Avoidance in Autonomous Driving Based on Deep Reinforcement Learning
    Liao, Yaping
    Yu, Guizhen
    Chen, Peng
    Zhou, Bin
    Li, Han
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 5229 - 5234
  • [34] Path Optimization for Autonomous Driving using Deep Learning
    Schitz, Dmitrij
    Aschemann, Harald
    IFAC PAPERSONLINE, 2022, 55 (27): : 490 - 496
  • [35] Explainable AI-based Federated Deep Reinforcement Learning for Trusted Autonomous Driving
    Rjoub, Gaith
    Bentahar, Jamal
    Wahab, Omar Abdel
    2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 318 - 323
  • [36] Soft collision avoidance based car following algorithm for autonomous driving with reinforcement learning
    Zheng, Yuqi
    Yan, Ruidong
    Jia, Bin
    Jiang, Rui
    Zheng, Shiteng
    Physica A: Statistical Mechanics and its Applications, 2024, 654
  • [37] Video Representation Learning for Decoupled Deep Reinforcement Learning Applied to Autonomous Driving
    Mohammed, Shawan Taha
    Kastouri, Mohamed
    Niederfahrenhorst, Artur
    Ascheid, Gerd
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [38] Deep reinforcement learning and robust SLAM based robotic control algorithm for self-driving path optimization
    Khan, Samiullah
    Niaz, Ashfaq
    Yinke, Dou
    Shoukat, Muhammad Usman
    Nawaz, Saqib Ali
    Frontiers in Neurorobotics, 2024, 18
  • [39] A Deep Reinforcement Learning Based Motion Cueing Algorithm for Vehicle Driving Simulation
    Scheidel, Hendrik
    Asadi, Houshyar
    Bellmann, Tobias
    Seefried, Andreas
    Mohamed, Shady
    Nahavandi, Saeid
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9696 - 9705
  • [40] Autonomous driving policy learning based on deep reinforcement learning and multi-type sensor data
    Yang S.
    Jiang Y.-D.
    Wu J.
    Liu H.-Z.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2019, 49 (04): : 1026 - 1033