Adaptive dynamic programming and deep reinforcement learning for the control of an unmanned surface vehicle: Experimental results

被引:0
|
作者
Gonzalez-Garcia, Alejandro [1 ]
Barragan-Alcantar, David [1 ]
Collado-Gonzalez, Ivana [1 ]
Garrido, Leonardo [1 ]
机构
[1] Tecnol Monterrey, Sch Sci & Engn, Av Eugenio Garza Sada 2501 Sur, Monterrey 64849, Nuevo Leon, Mexico
关键词
Unmanned surface vehicle (USV); Adaptive dynamic programming; Backpropagation through time; Deep reinforcement learning; Marine robotics; BACKPROPAGATION; TRACKING;
D O I
10.1016/j.conengprac.2021.104807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a low-level controller for an unmanned surface vehicle based on adaptive dynamic programming and deep reinforcement learning. This approach uses a single deep neural network capable of self-learning a policy, and driving the surge speed and yaw dynamics of a vessel. A simulation of the vehicle mathematical model was used to train the neural network with the model-based backpropagation through time algorithm, capable of dealing with continuous action-spaces. The path-following control scenario is additionally addressed by combining the proposed low-level controller and a line-of-sight based guidance law with time-varying look-ahead distance. Simulation and real-world experimental results are presented to validate the control capabilities of the proposed approach and contribute to the diversity of validated applications of adaptive dynamic programming based control strategies. Results show the controller is capable of self-learning the policy to drive the surge speed and yaw dynamics, and has an improved performance in comparison to a standard controller.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
    Wang, Shuwu
    Ma, Feng
    Yan, Xinping
    Wu, Peng
    Liu, Yuanchang
    [J]. APPLIED OCEAN RESEARCH, 2021, 110
  • [2] Research on Control of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Li, Baoan
    [J]. Ship Building of China, 2020, 61 : 14 - 20
  • [3] Speed and heading control of an unmanned surface vehicle using deep reinforcement learning
    Wu, Ting
    Ye, Hui
    Xiang, Zhengrong
    Yang, Xiaofei
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 573 - 578
  • [4] Deep reinforcement learning-based controller for dynamic positioning of an unmanned surface vehicle
    Yuan, Wei
    Rui, Xingwen
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [5] Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control
    Lewis, Frank L.
    Vrabie, Draguna
    [J]. IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2009, 9 (03) : 32 - 50
  • [6] Collision avoidance for an unmanned surface vehicle using deep reinforcement learning
    Woo, Joohyun
    Kim, Nakwan
    [J]. OCEAN ENGINEERING, 2020, 199
  • [7] Research on Path Tracking Control Method of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Guo, Rui
    Yuan, Wei
    [J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [8] Special Issue on Deep Reinforcement Learning and Adaptive Dynamic Programming
    Zhao, Dongbin
    Liu, Derong
    Lewis, F. L.
    Principe, Jose C.
    Squartini, Stefano
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2038 - 2041
  • [9] Deep hierarchical reinforcement learning based formation planning for multiple unmanned surface vehicles with experimental results
    Wei, Xiangwei
    Wang, Hao
    Tang, Yixuan
    [J]. OCEAN ENGINEERING, 2023, 286
  • [10] Deep reinforcement learning-based controller for path following of an unmanned surface vehicle
    Woo, Joohyun
    Yu, Chanwoo
    Kim, Nakwan
    [J]. OCEAN ENGINEERING, 2019, 183 : 155 - 166