Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引:0
|
作者
Li, Xinmao [1 ,2 ]
Geng, Lingbo [1 ]
Liu, Kaizhou [1 ]
Zhao, Yifeng [1 ,2 ]
Du, Weifeng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2024.119432
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Prioritized experience replay based reinforcement learning for adaptive tracking control of autonomous underwater vehicle
    Li, Ting
    Yang, Dongsheng
    Xie, Xiangpeng
    APPLIED MATHEMATICS AND COMPUTATION, 2023, 443
  • [32] An obstacle avoiding method of autonomous underwater vehicle based on the reinforcement learning
    Li, Wenbiao
    Yang, Xian
    Yan, Jing
    Luo, Xiaoyuan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 4538 - 4543
  • [33] Control of the Motion Orientation of Autonomous Underwater Vehicle
    Nguyen Quang Vinh
    Pham Van Phuc
    PROCEEDINGS OF THE 13TH INTERNATIONAL SYMPOSIUM INTELLIGENT SYSTEMS 2018 (INTELS'18), 2019, 150 : 69 - 77
  • [34] Research on motion control of an autonomous underwater vehicle
    Wang J.
    Sun Y.
    Wan L.
    Jiang D.
    Chang W.
    Gaojishu Tongxin/Chinese High Technology Letters, 2010, 20 (11): : 1156 - 1161
  • [35] Integration of Robust Control with Reinforcement Learning for Safe Autonomous Vehicle Motion
    Lelko, Attila
    Nemeth, Balazs
    Fenyes, Daniel
    Gaspar, Peter
    IFAC PAPERSONLINE, 2023, 56 (02): : 1101 - 1106
  • [36] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    APPLIED OCEAN RESEARCH, 2022, 129
  • [37] Autonomous underwater vehicle control using reinforcement learning policy search methods
    El-Fakdi, A
    Carreras, M
    Palomeras, N
    Ridao, P
    OCEANS 2005 - EUROPE, VOLS 1 AND 2, 2005, : 793 - 798
  • [38] Dynamics-Aligned Transfer Reinforcement Learning For Autonomous Underwater Vehicle Control
    Cheng, Kai
    Lu, Wenjie
    Xiong, Hao
    Liu, Honghai
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 1040 - 1045
  • [39] A physics-informed reinforcement learning-based strategy for local and coordinated ramp metering
    Han, Yu
    Wang, Meng
    Li, Linghui
    Roncoli, Claudio
    Gao, Jinda
    Liu, Pan
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 137
  • [40] Physics-Informed Transfer Learning for Process Control Applications
    Arce Munoz, Samuel
    Pershing, Jonathan
    Hedengren, John D.
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2024,