LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle

被引：4

作者：

Xia, Jiawei ^{[1
,2
]}

Zhu, Xufang ^{[3
]}

Liu, Zhong ^{[1
]}

Xia, Qingtao ^{[1
]}

机构：

[1] Naval Univ Engn, Sch Weaponry Engn, Wuhan 430033, Peoples R China

[2] Naval Aviat Univ, Qingdao Campus, Qingdao 266041, Peoples R China

[3] Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China

来源：

JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS | 2023年 / 34卷 / 05期

基金：

中国博士后科学基金;

关键词：

unmanned surface vehicle (USV); deep reinforcement learning (DRL); path following; path dataset; proximal policy optimization; long short-term memory (LSTM); LINE TRACKING; ALGORITHMS; SPEED; LEVEL;

D O I：

10.23919/JSEE.2023.000113

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To solve the path following control problem for unmanned surface vehicles (USVs), a control method based on deep reinforcement learning (DRL) with long short-term memory (LSTM) networks is proposed. A distributed proximal policy optimization (DPPO) algorithm, which is a modified actorcritic-based type of reinforcement learning algorithm, is adapted to improve the controller performance in repeated trials. The LSTM network structure is introduced to solve the strong temporal correlation USV control problem. In addition, a specially designed path dataset, including straight and curved paths, is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible. Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice.

引用

页码：1343 / 1358

页数：16

共 50 条

[31] Path Following Control for Unmanned Surface Vehicles: A Reinforcement Learning-Based Method With Experimental Validation
Wang, Yuanda
Cao, Jingyu
Sun, Jia
Zou, Xuesong
Sun, Changyin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 12 (18237-18250) : 1 - 14
[32] Data-driven unmanned surface vessel path following control method based on reinforcement learning
Deng, Weinan
Li, Hao
Wen, YuanQiao
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3035 - 3040
[33] An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
Guo, Siyu
Zhang, Xiuguo
Zheng, Yisong
Du, Yiquan
SENSORS, 2020, 20 (02)
[34] A novel path planning approach for unmanned ships based on deep reinforcement learning
Chen, Chen
Ma, Feng
Liu, Jia-Lun
Yan, Xin-Ping
Chen, Xian-Qiao
DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
[35] Disturbance learning controller design for unmanned surface vehicle using LSTM technique of recurrent neural network
Jeong, Sang-Ki
Ji, Dea-Hyeong
Oh, Ji-Youn
Seo, Jung-Min
Choi, Hyeung-Sik
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 8001 - 8011
[36] Efficient Path Following Algorithm for Unmanned Surface Vehicle
Niu, Hanlin
Lu, Yu
Savvaris, Al
Tsourdos, Antonios
OCEANS 2016 - SHANGHAI, 2016,
[37] Vehicle-Following Control Based on Deep Reinforcement Learning
Huang, Yong
Xu, Xin
Li, Yong
Zhang, Xinglong
Liu, Yao
Zhang, Xiaochuan
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[38] Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning
Polvara, Riccardo
Sharma, Sanjay
Wan, Jian
Manning, Andrew
Sutton, Robert
ROBOTICA, 2019, 37 (11) : 1867 - 1882
[39] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
Xie, Ronglei
Meng, Zhijun
Wang, Lifeng
Li, Haochen
Wang, Kaipeng
Wu, Zhe
IEEE Access, 2021, 9 : 24884 - 24900
[40] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
Xie, Ronglei
Meng, Zhijun
Wang, Lifeng
Li, Haochen
Wang, Kaipeng
Wu, Zhe
IEEE ACCESS, 2021, 9 : 24884 - 24900

← 1 2 3 4 5 →