Stability Analysis for Autonomous Vehicle Navigation Trained over Deep Deterministic Policy Gradient

被引:1
|
作者
Cabezas-Olivenza, Mireya [1 ]
Zulueta, Ekaitz [1 ]
Sanchez-Chica, Ander [1 ]
Fernandez-Gamiz, Unai [2 ]
Teso-Fz-Betono, Adrian [1 ]
机构
[1] Univ Basque Country UPV EHU, Syst Engn & Automat Control Dept, Nieves Cano 12, Vitoria 01006, Spain
[2] Univ Basque Country UPV EHU, Dept Nucl & Fluid Mech, Nieves Cano 12, Vitoria 01006, Spain
关键词
navigation; neural network; autonomous vehicle; reinforcement learning; DDPG; lyapunov; stability; q-learning; DYNAMIC WINDOW APPROACH; ROBOT;
D O I
10.3390/math11010132
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The Deep Deterministic Policy Gradient (DDPG) algorithm is a reinforcement learning algorithm that combines Q-learning with a policy. Nevertheless, this algorithm generates failures that are not well understood. Rather than looking for those errors, this study presents a way to evaluate the suitability of the results obtained. Using the purpose of autonomous vehicle navigation, the DDPG algorithm is applied, obtaining an agent capable of generating trajectories. This agent is evaluated in terms of stability through the Lyapunov function, verifying if the proposed navigation objectives are achieved. The reward function of the DDPG is used because it is unknown if the neural networks of the actor and the critic are correctly trained. Two agents are obtained, and a comparison is performed between them in terms of stability, demonstrating that the Lyapunov function can be used as an evaluation method for agents obtained by the DDPG algorithm. Verifying the stability at a fixed future horizon, it is possible to determine whether the obtained agent is valid and can be used as a vehicle controller, so a task-satisfaction assessment can be performed. Furthermore, the proposed analysis is an indication of which parts of the navigation area are insufficient in training terms.
引用
下载
收藏
页数:27
相关论文
共 50 条
  • [31] Deep Deterministic Policy Gradient With Classified Experience Replay
    Shi S.-M.
    Liu Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (07): : 1816 - 1823
  • [32] Deep Deterministic Policy Gradient With Compatible Critic Network
    Wang, Di
    Hu, Mengqi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4332 - 4344
  • [33] Deep deterministic policy gradient algorithm: A systematic review
    Sumiea, Ebrahim Hamid
    Abdulkadir, Said Jadid
    Alhussian, Hitham Seddig
    Al-Selwi, Safwan Mahmood
    Alqushaibi, Alawi
    Ragab, Mohammed Gamal
    Fati, Suliman Mohamed
    HELIYON, 2024, 10 (09)
  • [34] Deep deterministic policy gradient algorithm for UAV control
    Huang X.
    Liu J.
    Jia C.
    Wang Z.
    Zhang J.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):
  • [35] Developing Flight Control Policy Using Deep Deterministic Policy Gradient
    Tsourdos, Antonios
    Permana, Adhi Dharma
    Budiarti, Dewi H.
    Shin, Hyo-Sang
    Lee, Chang-Hun
    2019 IEEE INTERNATIONAL CONFERENCE ON AEROSPACE ELECTRONICS AND REMOTE SENSING TECHNOLOGY (ICARES 2019), 2019,
  • [36] Unmanned Aerial Vehicle Trajectory Planning and Power Control Algorithm Based on Deep Deterministic Policy Gradient
    Yang Q.
    Chen J.
    Peng Y.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (03): : 43 - 48
  • [37] Visibility Analysis for Autonomous Vehicle Comfortable Navigation
    Morales, Yoichi
    Even, Jani
    Kallakuri, Nagasrikanth
    Ikeda, Tetsushi
    Shinozawa, Kazuhiko
    Kondo, Tadahisa
    Hagita, Norihiro
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2197 - 2202
  • [38] A model predictive control trajectory tracking lateral controller for autonomous vehicles combined with deep deterministic policy gradient
    Xie, Zhaokang
    Huang, Xiaoci
    Luo, Suyun
    Zhang, Ruoping
    Ma, Fang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024, 46 (08) : 1507 - 1519
  • [39] Autonomous handover parameter optimisation for 5G cellular networks using deep deterministic policy gradient
    Kwong, Chiew Foong
    Shi, Chenhao
    Liu, Qianyu
    Yang, Sen
    Chieng, David
    Kar, Pushpendu
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [40] Design of a Path-Following Controller for Autonomous Vehicles Using an Optimized Deep Deterministic Policy Gradient Method
    Rizehvandi, Ali
    Azadi, Shahram
    International Journal of Automotive and Mechanical Engineering, 2024, 21 (03) : 11682 - 11694