Stability Analysis for Autonomous Vehicle Navigation Trained over Deep Deterministic Policy Gradient

被引：1

作者：

Cabezas-Olivenza, Mireya ^{[1
]}

Zulueta, Ekaitz ^{[1
]}

Sanchez-Chica, Ander ^{[1
]}

Fernandez-Gamiz, Unai ^{[2
]}

Teso-Fz-Betono, Adrian ^{[1
]}

机构：

[1] Univ Basque Country UPV EHU, Syst Engn & Automat Control Dept, Nieves Cano 12, Vitoria 01006, Spain

[2] Univ Basque Country UPV EHU, Dept Nucl & Fluid Mech, Nieves Cano 12, Vitoria 01006, Spain

来源：

MATHEMATICS | 2023年 / 11卷 / 01期

关键词：

navigation; neural network; autonomous vehicle; reinforcement learning; DDPG; lyapunov; stability; q-learning; DYNAMIC WINDOW APPROACH; ROBOT;

D O I：

10.3390/math11010132

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

The Deep Deterministic Policy Gradient (DDPG) algorithm is a reinforcement learning algorithm that combines Q-learning with a policy. Nevertheless, this algorithm generates failures that are not well understood. Rather than looking for those errors, this study presents a way to evaluate the suitability of the results obtained. Using the purpose of autonomous vehicle navigation, the DDPG algorithm is applied, obtaining an agent capable of generating trajectories. This agent is evaluated in terms of stability through the Lyapunov function, verifying if the proposed navigation objectives are achieved. The reward function of the DDPG is used because it is unknown if the neural networks of the actor and the critic are correctly trained. Two agents are obtained, and a comparison is performed between them in terms of stability, demonstrating that the Lyapunov function can be used as an evaluation method for agents obtained by the DDPG algorithm. Verifying the stability at a fixed future horizon, it is possible to determine whether the obtained agent is valid and can be used as a vehicle controller, so a task-satisfaction assessment can be performed. Furthermore, the proposed analysis is an indication of which parts of the navigation area are insufficient in training terms.

引用

页数：27

共 50 条

[1] AUTONOMOUS VEHICLE DRIVING VIA DEEP DETERMINISTIC POLICY GRADIENT
Huang, Wenhui
Braghin, Francesco
Arrigoni, Stefano
PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 3, 2020,
[2] Dynamic Multi-Agent Deep Deterministic Policy Gradient for Autonomous Navigation of Reconfigurable Unmanned Aerial Vehicle
Lu Xin
Wu Zegui
Zhao Ruqing
Li Fusheng
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 10574 - 10580
[3] Deep Deterministic Policy Gradient for Navigation of Mobile Robots
de Jesus, Junior Costa
Bottega, Jair Augusto
de Souza Leite Cuadros, Marco Antonio
Tello Gamarra, Daniel Fernando
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 349 - 361
[4] Sim-to-Real Transfer of Image-Based Autonomous Guidewire Navigation Trained by Deep Deterministic Policy Gradient with Behavior Cloning for Fast Learning
Cho, Yongjun
Park, Jae-Hyeon
Choi, Jaesoon
Chang, Dong Eui
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3468 - 3475
[5] Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient
Li B.
Yue K.-Q.
Gan Z.-G.
Gao P.-X.
Yuhang Xuebao/Journal of Astronautics, 2021, 42 (06): : 757 - 765
[6] Deep Deterministic Policy Gradient for Navigation of Mobile Robots in Simulated Environments
Jesus, Junior C.
Bottega, Jair A.
Cuadros, Marco A. S. L.
Gamarra, Daniel F. T.
2019 19TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2019, : 362 - 367
[7] Robot arm navigation using deep deterministic policy gradient algorithms
Farag, Wael
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (05) : 617 - 627
[8] Research on deep deterministic policy gradient guidance method for reentry vehicle
Guo, Dongzi
Huang, Rong
Xu, Hechuan
Sun, Liwei
Cui, Naigang
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (06): : 1942 - 1949
[9] Composite deep learning control for autonomous bicycles by using deep deterministic policy gradient
He, Kanghui
Dong, Chaoyang
Yan, An
Zheng, Qingyuan
Liang, Bin
Wang, Qing
IECON 2020: THE 46TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2020, : 2766 - 2773
[10] Perception Enhanced Deep Deterministic Policy Gradient for Autonomous Driving in Complex Scenarios
Liao, Lyuchao
Xiao, Hankun
Xing, Pengqi
Gan, Zhenhua
He, Youpeng
Wang, Jiajun
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 140 (01): : 557 - 576

← 1 2 3 4 5 →