Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning

被引:0
|
作者
Li, Xinmao [1 ,2 ]
Geng, Lingbo [1 ]
Liu, Kaizhou [1 ]
Zhao, Yifeng [1 ,2 ]
Du, Weifeng [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; Offline reinforcement learning; Physics-informed reinforcement learning; Physics informed neural network; Motion control; TRAJECTORY TRACKING;
D O I
10.1016/j.oceaneng.2024.119432
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Online reinforcement learning (RL) methods for autonomous underwater vehicles (AUV) are time-consuming and unsafe due to the need for real-world interaction. Offline RL methods can improve efficiency and safety by training with dynamic models, but an accurate model for AUV is difficult to obtain due to its highly nonlinear dynamics. These limit the application of RL methods in AUV control. To solve this issue, we propose physicsinformed model-based conservative offline policy optimization (PICOPO). It offers the advantages of small dataset, strong generalizability and high safety by combining the physics-informed dynamic modelling method and the offline RL technique. First, the PICOPO constructs a physics-informed model based on a small offline dataset to serve as the digital twins (DT) of the actual AUV. This DT can forecast the long-term motion states of AUV with high-precision. The RL-based controller is then trained offline within this DT, eliminating the need for real-world interaction and allowing direct deployment to the AUV without fine-tuning. In this paper, simulations and field tests are carried out to evaluate the proposed method. Our results demonstrate that PICOPO achieves accurate motion control with just 2000 samples and enables zero-shot sim-to-real transfer, showcasing strong generalizability across various motion control tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Transient Voltage Control Based on Physics-Informed Reinforcement Learning
    Gao, Jiemai
    Chen, Siyuan
    Li, Xiang
    Zhang, Jun
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 905 - 910
  • [2] Research on Modeling Method of Autonomous Underwater Vehicle Based on a Physics-Informed Neural Network
    Zhao, Yifeng
    Hu, Zhiqiang
    Du, Weifeng
    Geng, Lingbo
    Yang, Yi
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (05)
  • [3] Physics-informed reinforcement learning for motion control of a fish-like swimming robot
    Colin Rodwell
    Phanindra Tallapragada
    Scientific Reports, 13
  • [4] Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning
    Liu, Tao
    Huang, Junhao
    Zhao, Jintao
    ROBOTICA, 2024,
  • [5] Physics-informed reinforcement learning for motion control of a fish-like swimming robot
    Rodwell, Colin
    Tallapragada, Phanindra
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [6] Physics-Informed Model-Based Reinforcement Learning
    Ramesh, Adithya
    Ravindran, Balaraman
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [7] Physics-informed reinforcement learning for optimal control of nonlinear systems
    Wang, Yujia
    Wu, Zhe
    AICHE JOURNAL, 2024, 70 (10)
  • [8] Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
    Anderlini, Enrico
    Parker, Gordon G.
    Thomas, Giles
    APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [9] A physics-informed deep reinforcement learning framework for autonomous steel frame structure design
    Fu, Bochao
    Gao, Yuqing
    Wang, Wei
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (20) : 3125 - 3144
  • [10] Inverter PQ Control With Trajectory Tracking Capability for Microgrids Based on Physics-Informed Reinforcement Learning
    She, Buxin
    Li, Fangxing
    Cui, Hantao
    Shuai, Hang
    Oboreh-Snapps, Oroghene
    Bo, Rui
    Praisuwanna, Nattapat
    Wang, Jingxin
    Tolbert, Leon M.
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (01) : 99 - 112