Stability Analysis for Autonomous Vehicle Navigation Trained over Deep Deterministic Policy Gradient

被引:1
|
作者
Cabezas-Olivenza, Mireya [1 ]
Zulueta, Ekaitz [1 ]
Sanchez-Chica, Ander [1 ]
Fernandez-Gamiz, Unai [2 ]
Teso-Fz-Betono, Adrian [1 ]
机构
[1] Univ Basque Country UPV EHU, Syst Engn & Automat Control Dept, Nieves Cano 12, Vitoria 01006, Spain
[2] Univ Basque Country UPV EHU, Dept Nucl & Fluid Mech, Nieves Cano 12, Vitoria 01006, Spain
关键词
navigation; neural network; autonomous vehicle; reinforcement learning; DDPG; lyapunov; stability; q-learning; DYNAMIC WINDOW APPROACH; ROBOT;
D O I
10.3390/math11010132
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The Deep Deterministic Policy Gradient (DDPG) algorithm is a reinforcement learning algorithm that combines Q-learning with a policy. Nevertheless, this algorithm generates failures that are not well understood. Rather than looking for those errors, this study presents a way to evaluate the suitability of the results obtained. Using the purpose of autonomous vehicle navigation, the DDPG algorithm is applied, obtaining an agent capable of generating trajectories. This agent is evaluated in terms of stability through the Lyapunov function, verifying if the proposed navigation objectives are achieved. The reward function of the DDPG is used because it is unknown if the neural networks of the actor and the critic are correctly trained. Two agents are obtained, and a comparison is performed between them in terms of stability, demonstrating that the Lyapunov function can be used as an evaluation method for agents obtained by the DDPG algorithm. Verifying the stability at a fixed future horizon, it is possible to determine whether the obtained agent is valid and can be used as a vehicle controller, so a task-satisfaction assessment can be performed. Furthermore, the proposed analysis is an indication of which parts of the navigation area are insufficient in training terms.
引用
下载
收藏
页数:27
相关论文
共 50 条
  • [1] AUTONOMOUS VEHICLE DRIVING VIA DEEP DETERMINISTIC POLICY GRADIENT
    Huang, Wenhui
    Braghin, Francesco
    Arrigoni, Stefano
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 3, 2020,
  • [2] Deep Deterministic Policy Gradient for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Bottega, Jair Augusto
    de Souza Leite Cuadros, Marco Antonio
    Tello Gamarra, Daniel Fernando
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 349 - 361
  • [3] Sim-to-Real Transfer of Image-Based Autonomous Guidewire Navigation Trained by Deep Deterministic Policy Gradient with Behavior Cloning for Fast Learning
    Cho, Yongjun
    Park, Jae-Hyeon
    Choi, Jaesoon
    Chang, Dong Eui
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3468 - 3475
  • [4] Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient
    Li B.
    Yue K.-Q.
    Gan Z.-G.
    Gao P.-X.
    Yuhang Xuebao/Journal of Astronautics, 2021, 42 (06): : 757 - 765
  • [5] Deep Deterministic Policy Gradient for Navigation of Mobile Robots in Simulated Environments
    Jesus, Junior C.
    Bottega, Jair A.
    Cuadros, Marco A. S. L.
    Gamarra, Daniel F. T.
    2019 19TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2019, : 362 - 367
  • [6] Robot arm navigation using deep deterministic policy gradient algorithms
    Farag, Wael
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (05) : 617 - 627
  • [7] Research on deep deterministic policy gradient guidance method for reentry vehicle
    Guo, Dongzi
    Huang, Rong
    Xu, Hechuan
    Sun, Liwei
    Cui, Naigang
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (06): : 1942 - 1949
  • [8] Composite deep learning control for autonomous bicycles by using deep deterministic policy gradient
    He, Kanghui
    Dong, Chaoyang
    Yan, An
    Zheng, Qingyuan
    Liang, Bin
    Wang, Qing
    IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 2766 - 2773
  • [9] Perception Enhanced Deep Deterministic Policy Gradient for Autonomous Driving in Complex Scenarios
    Liao, Lyuchao
    Xiao, Hankun
    Xing, Pengqi
    Gan, Zhenhua
    He, Youpeng
    Wang, Jiajun
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 140 (01): : 557 - 576
  • [10] Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method
    Xu, Zhao
    Lyu, Yang
    Pan, Quan
    Hu, Jinwen
    Zhao, Chunhui
    Liu, Shuai
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2018, : 306 - 311