LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle

被引:4
|
作者
Xia, Jiawei [1 ,2 ]
Zhu, Xufang [3 ]
Liu, Zhong [1 ]
Xia, Qingtao [1 ]
机构
[1] Naval Univ Engn, Sch Weaponry Engn, Wuhan 430033, Peoples R China
[2] Naval Aviat Univ, Qingdao Campus, Qingdao 266041, Peoples R China
[3] Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China
基金
中国博士后科学基金;
关键词
unmanned surface vehicle (USV); deep reinforcement learning (DRL); path following; path dataset; proximal policy optimization; long short-term memory (LSTM); LINE TRACKING; ALGORITHMS; SPEED; LEVEL;
D O I
10.23919/JSEE.2023.000113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve the path following control problem for unmanned surface vehicles (USVs), a control method based on deep reinforcement learning (DRL) with long short-term memory (LSTM) networks is proposed. A distributed proximal policy optimization (DPPO) algorithm, which is a modified actorcritic-based type of reinforcement learning algorithm, is adapted to improve the controller performance in repeated trials. The LSTM network structure is introduced to solve the strong temporal correlation USV control problem. In addition, a specially designed path dataset, including straight and curved paths, is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible. Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice.
引用
收藏
页码:1343 / 1358
页数:16
相关论文
共 50 条
  • [31] Path Following Control for Unmanned Surface Vehicles: A Reinforcement Learning-Based Method With Experimental Validation
    Wang, Yuanda
    Cao, Jingyu
    Sun, Jia
    Zou, Xuesong
    Sun, Changyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 12 (18237-18250) : 1 - 14
  • [32] Data-driven unmanned surface vessel path following control method based on reinforcement learning
    Deng, Weinan
    Li, Hao
    Wen, YuanQiao
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 3035 - 3040
  • [33] An Autonomous Path Planning Model for Unmanned Ships Based on Deep Reinforcement Learning
    Guo, Siyu
    Zhang, Xiuguo
    Zheng, Yisong
    Du, Yiquan
    SENSORS, 2020, 20 (02)
  • [34] A novel path planning approach for unmanned ships based on deep reinforcement learning
    Chen, Chen
    Ma, Feng
    Liu, Jia-Lun
    Yan, Xin-Ping
    Chen, Xian-Qiao
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 626 - 633
  • [35] Disturbance learning controller design for unmanned surface vehicle using LSTM technique of recurrent neural network
    Jeong, Sang-Ki
    Ji, Dea-Hyeong
    Oh, Ji-Youn
    Seo, Jung-Min
    Choi, Hyeung-Sik
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 8001 - 8011
  • [36] Efficient Path Following Algorithm for Unmanned Surface Vehicle
    Niu, Hanlin
    Lu, Yu
    Savvaris, Al
    Tsourdos, Antonios
    OCEANS 2016 - SHANGHAI, 2016,
  • [37] Vehicle-Following Control Based on Deep Reinforcement Learning
    Huang, Yong
    Xu, Xin
    Li, Yong
    Zhang, Xinglong
    Liu, Yao
    Zhang, Xiaochuan
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [38] Autonomous Vehicular Landings on the Deck of an Unmanned Surface Vehicle using Deep Reinforcement Learning
    Polvara, Riccardo
    Sharma, Sanjay
    Wan, Jian
    Manning, Andrew
    Sutton, Robert
    ROBOTICA, 2019, 37 (11) : 1867 - 1882
  • [39] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    IEEE Access, 2021, 9 : 24884 - 24900
  • [40] Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments
    Xie, Ronglei
    Meng, Zhijun
    Wang, Lifeng
    Li, Haochen
    Wang, Kaipeng
    Wu, Zhe
    IEEE ACCESS, 2021, 9 : 24884 - 24900