Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引:0
|
作者
Li, Xinmao [1 ,2 ]
Geng, Lingbo [1 ]
Liu, Kaizhou [1 ]
Zhao, Yifeng [1 ,2 ]
Du, Weifeng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2024.119432
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Safe Reinforcement Learning for Energy Management of Electrified Vehicle With Novel Physics-Informed Exploration Strategy
    Biswas, Atriya
    Acquarone, Matteo
    Wang, Hao
    Miretti, Federico
    Misul, Daniela Anna
    Emadi, Ali
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (04): : 9814 - 9828
  • [22] Efficient deep reinforcement learning strategies for active flow control based on physics-informed neural networks
    Hu, Wulong
    Jiang, Zhangze
    Xu, Mingyang
    Hu, Hanyu
    PHYSICS OF FLUIDS, 2024, 36 (07)
  • [23] Physics-Informed Particle-Based Reinforcement Learning for Autonomy in Signalized Intersections
    Emamifar, Mehrnoosh
    Ghoreishi, Seyede Fatemeh
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, 22 (02) : 416 - 430
  • [24] Physics-informed Dyna-style model-based deep reinforcement learning for dynamic control
    Liu, Xin-Yang
    Wang, Jian-Xun
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 477 (2255):
  • [25] Physics-informed reinforcement learning optimization of nuclear assembly design
    Radaideh, Majdi, I
    Wolverton, Isaac
    Joseph, Joshua
    Tusar, James J.
    Otgonbaatar, Uuganbayar
    Roy, Nicholas
    Forget, Benoit
    Shirvan, Koroush
    NUCLEAR ENGINEERING AND DESIGN, 2021, 372
  • [26] Safe Navigation of Autonomous Underwater Vehicles Using Physics-Informed Neural Networks
    Majumder, Rudrashis
    Makam, Rajini
    Mane, Pruthviraj
    BharathwajK.S
    Sundaram, Suresh
    Oceans Conference Record (IEEE), 2024,
  • [27] Reinforcement learning based parameter optimization of active disturbance rejection control for autonomous underwater vehicle
    SONG Wanping
    CHEN Zengqiang
    SUN Mingwei
    SUN Qinglin
    JournalofSystemsEngineeringandElectronics, 2022, 33 (01) : 170 - 179
  • [28] Safe Navigation of Autonomous Underwater Vehicles Using Physics-informed Neural Networks
    Majumder, Rudrashis
    Makam, Rajini
    Mane, Pruthviraj
    Bharathwaj, K. S.
    Sundaram, Suresh
    OCEANS 2024 - SINGAPORE, 2024,
  • [29] Path-following optimal control of autonomous underwater vehicle based on deep reinforcement learning
    Wang, Zhanyuan
    Li, Yulong
    Ma, Caipeng
    Yan, Xun
    Jiang, Dapeng
    OCEAN ENGINEERING, 2023, 268
  • [30] Reinforcement learning based parameter optimization of active disturbance rejection control for autonomous underwater vehicle
    Song Wanping
    Chen Zengqiang
    Sun Mingwei
    Sun Qinglin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (01) : 170 - 179