LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle

被引：4

作者：

Xia, Jiawei ^{[1
,2
]}

Zhu, Xufang ^{[3
]}

Liu, Zhong ^{[1
]}

Xia, Qingtao ^{[1
]}

机构：

[1] Naval Univ Engn, Sch Weaponry Engn, Wuhan 430033, Peoples R China

[2] Naval Aviat Univ, Qingdao Campus, Qingdao 266041, Peoples R China

[3] Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China

来源：

JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS | 2023年 / 34卷 / 05期

基金：

中国博士后科学基金;

关键词：

unmanned surface vehicle (USV); deep reinforcement learning (DRL); path following; path dataset; proximal policy optimization; long short-term memory (LSTM); LINE TRACKING; ALGORITHMS; SPEED; LEVEL;

D O I：

10.23919/JSEE.2023.000113

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To solve the path following control problem for unmanned surface vehicles (USVs), a control method based on deep reinforcement learning (DRL) with long short-term memory (LSTM) networks is proposed. A distributed proximal policy optimization (DPPO) algorithm, which is a modified actorcritic-based type of reinforcement learning algorithm, is adapted to improve the controller performance in repeated trials. The LSTM network structure is introduced to solve the strong temporal correlation USV control problem. In addition, a specially designed path dataset, including straight and curved paths, is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible. Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice.

引用

页码：1343 / 1358

页数：16

共 50 条

[41] Deep Merging: Vehicle Merging Controller Based on Deep Reinforcement Learning with Embedding Network
Nishitani, Ippei
Yang, Hao
Guo, Rui
Keshavamurthy, Shalini
Oguchi, Kentaro
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 216 - 221
[42] Path Following with Supervised Deep Reinforcement Learning
Gu, Wen-Yi
Xu, Xin
Yang, Jian
PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 448 - 452
[43] Research on Global Path Planning of Unmanned Surface Vehicle Based on Environmental Optimization
Xu, Pengfei
Ding, Yanxu
Cao, Qingbo
Ship Building of China, 2022, 63 (05): : 206 - 220
[44] A Nonlinear Path Following Controller for an Underactuated Unmanned Surface Vessel
Daly, John M.
Tribou, Michael J.
Waslander, Steven L.
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 82 - 87
[45] Multi-sensor based strategy learning with deep reinforcement learning for unmanned ground vehicle
Luo M.
International Journal of Intelligent Networks, 2023, 4 : 325 - 336
[46] Learn to Navigate: Cooperative Path Planning for Unmanned Surface Vehicles Using Deep Reinforcement Learning
Zhou, Xinyuan
Wu, Peng
Zhang, Haifeng
Guo, Weihong
Liu, Yuanchang
IEEE ACCESS, 2019, 7 : 165262 - 165278
[47] Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning
Guo, Hongda
Xu, Youchun
Ma, Yulin
Xu, Shucai
Li, Zhixiong
ELECTRONICS, 2023, 12 (23)
[48] Reinforcement learning-based fuzzy controller for autonomous guided vehicle path tracking
Kuo, Ping-Huan
Chen, Sing-Yan
Feng, Po-Hsun
Chang, Chen-Wen
Huang, Chiou-Jye
Peng, Chao-Chung
ADVANCED ENGINEERING INFORMATICS, 2025, 65
[49] Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
Wang, Shuwu
Ma, Feng
Yan, Xinping
Wu, Peng
Liu, Yuanchang
APPLIED OCEAN RESEARCH, 2021, 110
[50] Distributed Unmanned Aerial Vehicle Cluster Testing Method Based on Deep Reinforcement Learning
Li, Dong
Yang, Panfei
APPLIED SCIENCES-BASEL, 2024, 14 (23):

← 1 2 3 4 5 →