LSTM-DPPO based deep reinforcement learning controller for path following optimization of unmanned surface vehicle

被引:4
|
作者
Xia, Jiawei [1 ,2 ]
Zhu, Xufang [3 ]
Liu, Zhong [1 ]
Xia, Qingtao [1 ]
机构
[1] Naval Univ Engn, Sch Weaponry Engn, Wuhan 430033, Peoples R China
[2] Naval Aviat Univ, Qingdao Campus, Qingdao 266041, Peoples R China
[3] Naval Univ Engn, Sch Elect Engn, Wuhan 430033, Peoples R China
基金
中国博士后科学基金;
关键词
unmanned surface vehicle (USV); deep reinforcement learning (DRL); path following; path dataset; proximal policy optimization; long short-term memory (LSTM); LINE TRACKING; ALGORITHMS; SPEED; LEVEL;
D O I
10.23919/JSEE.2023.000113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve the path following control problem for unmanned surface vehicles (USVs), a control method based on deep reinforcement learning (DRL) with long short-term memory (LSTM) networks is proposed. A distributed proximal policy optimization (DPPO) algorithm, which is a modified actorcritic-based type of reinforcement learning algorithm, is adapted to improve the controller performance in repeated trials. The LSTM network structure is introduced to solve the strong temporal correlation USV control problem. In addition, a specially designed path dataset, including straight and curved paths, is established to simulate various sailing scenarios so that the reinforcement learning controller can obtain as much handling experience as possible. Extensive numerical simulation results demonstrate that the proposed method has better control performance under missions involving complex maneuvers than trained with limited scenarios and can potentially be applied in practice.
引用
收藏
页码:1343 / 1358
页数:16
相关论文
共 50 条
  • [41] Deep Merging: Vehicle Merging Controller Based on Deep Reinforcement Learning with Embedding Network
    Nishitani, Ippei
    Yang, Hao
    Guo, Rui
    Keshavamurthy, Shalini
    Oguchi, Kentaro
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 216 - 221
  • [42] Path Following with Supervised Deep Reinforcement Learning
    Gu, Wen-Yi
    Xu, Xin
    Yang, Jian
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 448 - 452
  • [43] Research on Global Path Planning of Unmanned Surface Vehicle Based on Environmental Optimization
    Xu, Pengfei
    Ding, Yanxu
    Cao, Qingbo
    Ship Building of China, 2022, 63 (05): : 206 - 220
  • [44] A Nonlinear Path Following Controller for an Underactuated Unmanned Surface Vessel
    Daly, John M.
    Tribou, Michael J.
    Waslander, Steven L.
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 82 - 87
  • [45] Multi-sensor based strategy learning with deep reinforcement learning for unmanned ground vehicle
    Luo M.
    International Journal of Intelligent Networks, 2023, 4 : 325 - 336
  • [46] Learn to Navigate: Cooperative Path Planning for Unmanned Surface Vehicles Using Deep Reinforcement Learning
    Zhou, Xinyuan
    Wu, Peng
    Zhang, Haifeng
    Guo, Weihong
    Liu, Yuanchang
    IEEE ACCESS, 2019, 7 : 165262 - 165278
  • [47] Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning
    Guo, Hongda
    Xu, Youchun
    Ma, Yulin
    Xu, Shucai
    Li, Zhixiong
    ELECTRONICS, 2023, 12 (23)
  • [48] Reinforcement learning-based fuzzy controller for autonomous guided vehicle path tracking
    Kuo, Ping-Huan
    Chen, Sing-Yan
    Feng, Po-Hsun
    Chang, Chen-Wen
    Huang, Chiou-Jye
    Peng, Chao-Chung
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [49] Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
    Wang, Shuwu
    Ma, Feng
    Yan, Xinping
    Wu, Peng
    Liu, Yuanchang
    APPLIED OCEAN RESEARCH, 2021, 110
  • [50] Distributed Unmanned Aerial Vehicle Cluster Testing Method Based on Deep Reinforcement Learning
    Li, Dong
    Yang, Panfei
    APPLIED SCIENCES-BASEL, 2024, 14 (23):