Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引:0
|
作者
Li, Xinmao [1 ,2 ]
Geng, Lingbo [1 ]
Liu, Kaizhou [1 ]
Zhao, Yifeng [1 ,2 ]
Du, Weifeng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2024.119432
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Physics-Informed Representation and Learning: Control and Risk Quantification
    Wang, Zhuoyuan
    Keller, Reece
    Deng, Xiyu
    Hoshino, Kenta
    Tanaka, Takashi
    Nakahira, Yorie
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21699 - 21707
  • [42] On-the-fly autonomous control of neutron diffraction via physics-informed Bayesian active learning
    McDannald, Austin
    Frontzek, Matthias
    Savici, Andrei T.
    Doucet, Mathieu
    Rodriguez, Efrain E.
    Meuse, Kate
    Opsahl-Ong, Jessica
    Samarov, Daniel
    Takeuchi, Ichiro
    Ratcliff, William
    Kusne, A. Gilad
    APPLIED PHYSICS REVIEWS, 2022, 9 (02)
  • [43] Autonomous Building Control Using Offline Reinforcement Learning
    Schepers, Jorren
    Eyckerman, Reinout
    Elmaz, Furkan
    Casteels, Wim
    Latre, Steven
    Hellinckx, Peter
    ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 3PGCIC-2021, 2022, 343 : 246 - 255
  • [44] Physics-informed machine learning
    George Em Karniadakis
    Ioannis G. Kevrekidis
    Lu Lu
    Paris Perdikaris
    Sifan Wang
    Liu Yang
    Nature Reviews Physics, 2021, 3 : 422 - 440
  • [45] Physics-informed machine learning
    Karniadakis, George Em
    Kevrekidis, Ioannis G.
    Lu, Lu
    Perdikaris, Paris
    Wang, Sifan
    Yang, Liu
    NATURE REVIEWS PHYSICS, 2021, 3 (06) : 422 - 440
  • [46] Physics-Informed Graph Learning
    Peng, Ciyuan
    Xia, Feng
    Saikrishna, Vidya
    Liu, Huan
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 732 - 739
  • [47] Physics-informed models of domain wall dynamics as a route for autonomous domain wall design via reinforcement learning
    Smith, Benjamin R.
    Pant, Bharat
    Liu, Yongtao
    Liu, Yu-Chen
    Yang, Jan-Chi
    Jesse, Stephen
    Khojandi, Anahita
    Kalinin, Sergei V.
    Cao, Ye
    Vasudevan, Rama K.
    DIGITAL DISCOVERY, 2024, 3 (03): : 456 - 466
  • [48] Physics-informed Reinforcement Learning optimization of PWR core loading pattern
    Seurin, Paul
    Shirvan, Koroush
    ANNALS OF NUCLEAR ENERGY, 2024, 208
  • [49] Accurate melting point prediction through autonomous physics-informed learning
    Klimanova O.
    Miryashkin T.
    Shapeev A.
    Physical Review B, 2023, 108 (18)
  • [50] Three-Dimensional Path Tracking Control of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Sun, Yushan
    Zhang, Chenming
    Zhang, Guocheng
    Xu, Hao
    Ran, Xiangrui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2019, 7 (12)