Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引:0
|
作者
Li, Xinmao [1 ,2 ]
Geng, Lingbo [1 ]
Liu, Kaizhou [1 ]
Zhao, Yifeng [1 ,2 ]
Du, Weifeng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2024.119432
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Phyllis: Physics-Informed Lifelong Reinforcement Learning for Data Center Cooling Control
    Wang, Ruihang
    Cao, Zhiwei
    Zhou, Xin
    Wen, Yonggang
    Tan, Rui
    PROCEEDINGS OF THE 2023 THE 14TH ACM INTERNATIONAL CONFERENCE ON FUTURE ENERGY SYSTEMS, E-ENERGY 2023, 2023, : 114 - 126
  • [12] Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle
    Prashant Bhopale
    Faruk Kazi
    Navdeep Singh
    Journal of Marine Science and Application, 2019, 18 : 228 - 238
  • [13] A general motion controller based on deep reinforcement learning for an autonomous underwater vehicle with unknown disturbances
    Huang, Fei
    Xu, Jian
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Xing, Wen
    Zhang, Xun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [14] Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle
    Bhopale, Prashant
    Kazi, Faruk
    Singh, Navdeep
    JOURNAL OF MARINE SCIENCE AND APPLICATION, 2019, 18 (02) : 228 - 238
  • [15] A control strategy of normal motion and active self-rescue for autonomous underwater vehicle based on deep reinforcement learning
    Fang, Yuan
    Pu, Jinyun
    Yuan, Chengren
    Cao, Yuxuan
    Liu, Shuyong
    AIP ADVANCES, 2022, 12 (01)
  • [16] Deep Reinforcement Learning Based Optimal Trajectory Tracking Control of Autonomous Underwater Vehicle
    Yu, Runsheng
    Shi, Zhenyu
    Huang, Chaoxing
    Li, Tenglong
    Ma, Qiongxiong
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 4958 - 4965
  • [17] Evaluation of a Deep-Reinforcement-Learning-based Controller for the Control of an Autonomous Underwater Vehicle
    Sola, Yoann
    Chaffre, Thomas
    le Chenadec, Gilles
    Sammut, Karl
    Clement, Benoit
    GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [18] Research on model predictive control of autonomous underwater vehicle based on physics informed neural network modeling
    Liu, Tao
    Zhao, Jintao
    Huang, Junhao
    Li, Zhenglin
    Xu, Lingji
    Zhao, Bo
    OCEAN ENGINEERING, 2024, 304
  • [19] A general motion control framework for an autonomous underwater vehicle through deep reinforcement learning and disturbance observers*
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Chen, Tao
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (08): : 5728 - 5758
  • [20] Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control
    Liu, Tao
    Hu, Yuli
    Xu, Hui
    COMPLEXITY, 2021, 2021