High-precision motion control of underwater gliders based on reinforcement learning

被引:1
|
作者
Juan, Rongshun [1 ]
Wang, Tianshu [1 ]
Liu, Shoufu [1 ]
Zhou, Yatao [1 ]
Ma, Wei [2 ]
Niu, Wendong [2 ]
Gao, Zhongke [1 ]
机构
[1] Tianjin Univ, Sch Elect Automat & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Sch Mech Engn, Tianjin 300072, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Underwater glider; Motion control; Reinforcement learning; Sea trail; Data driven; TRACKING CONTROL; NEURAL-NETWORK; VEHICLES; ROBOT; MODEL;
D O I
10.1016/j.oceaneng.2024.118603
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
An underwater glider (UG) is a buoyancy-propelled and fixed-wing vehicle with attitude controlled completely by means of internal mass redistribution. Since the highly nonlinear and cross-coupled system dynamics, uncertain hydrodynamic parameters and uncertain internal dynamics, model-based control is hardly possible to employ in motion control of UGs. This paper presents a model-free motion control framework of UGs which named inverse model control (IMC). It takes the desired velocity as input, and outputs the control variables of directly and enables the glider to complete the behaviors of motion. Besides, we propose a novel method to represent the framework based on model-free reinforcement learning, which named heterogeneous agent asynchronous policy gradient (HAAPG). It has achieved high-precision motion control and could intermittently adjust the movable mass block rotation amount to correct the deviation caused by currents, which is energysaving. To verify the effectiveness of the IMC framework and the superiority of HAPPG algorithm, the dynamic model of underwater glider is established and a series of motion simulation scenarios are established. In addition, we conduct sea trials and deploy our IMC framework into actual underwater gliders to perform tasks and achieved satisfactory performance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Collaborative Learning of High-Precision Quantum Control and Tomography
    Ding, Hai-Jin
    Chu, Bing
    Wu, Re-Bing
    IFAC PAPERSONLINE, 2019, 52 (29): : 128 - 133
  • [32] Motion control of autonomous underwater vehicle based on physics-informed offline reinforcement learning
    Li, Xinmao
    Geng, Lingbo
    Liu, Kaizhou
    Zhao, Yifeng
    Du, Weifeng
    OCEAN ENGINEERING, 2024, 313
  • [33] Meta Reinforcement Learning Based Underwater Manipulator Control
    Moon, Jiyoun
    Bae, Sung-hoon
    Cashmore, Michael
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 1473 - 1476
  • [34] High-Precision Contour-Tracking Control of Ethernet-Based Networked Motion Control Systems
    Wu, Xiang
    Dong, Hui
    She, Jinhua
    Yu, Li
    IEEJ JOURNAL OF INDUSTRY APPLICATIONS, 2020, 9 (01) : 1 - 10
  • [35] A HIGH-PRECISION STRAIGHT MOTION SYSTEM
    SHIMOKOHBE, A
    AOYAMA, H
    BULLETIN OF THE JAPAN SOCIETY OF PRECISION ENGINEERING, 1986, 20 (01): : 26 - 32
  • [36] High-Precision Motion Compensation for LiDAR Based on LiDAR Odometry
    Qin, Peilin
    Zhang, Chuanwei
    Ma, Xiaowen
    Shi, Zhenghe
    Wireless Communications and Mobile Computing, 2022, 2022
  • [37] High-Precision Motion Compensation for LiDAR Based on LiDAR Odometry
    Qin, Peilin
    Zhang, Chuanwei
    Ma, Xiaowen
    Shi, Zhenghe
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [38] A Base Station Placement Method For High-precision Positioning Using Reinforcement Learning
    Hwang J.G.
    Park J.G.
    Journal of Institute of Control, Robotics and Systems, 2023, 29 (11) : 836 - 840
  • [39] Integrated system development process for high-precision motion control systems
    Zschaeck, S.
    Amthor, A.
    Mueller, M.
    Kloeckner, J.
    Ament, Ch.
    Fengler, W.
    2010 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 2010, : 344 - 350
  • [40] Disturbance-rejection high-precision motion control of a Stewart platform
    Su, YX
    Duan, BY
    Zheng, CH
    Zhang, YF
    Chen, GD
    Mi, JW
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2004, 12 (03) : 364 - 374