Sliding mode heading control for AUV based on continuous hybrid model-free and model-based reinforcement learning

被引:16
|
作者
Wang, Dianrui [1 ]
Shen, Yue [1 ]
Wan, Junhe [1 ]
Sha, Qixin [1 ]
Li, Guangliang [1 ]
Chen, Guanzhong [1 ]
He, Bo [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao 266000, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle (AUV); Model-based reinforcement learning; Model-free reinforcement learning; Deterministic policy gradient (DPG); Sliding mode control (SMC); NONLINEAR-SYSTEMS; ADAPTIVE-CONTROL; PID CONTROL; DESIGN;
D O I
10.1016/j.apor.2021.102960
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
For autonomous underwater vehicles (AUVs), control over AUV heading is of key importance to enable highperformance locomotion control. In this study, the heading control is achieved by using the robust sliding mode control (SMC) method. The performance of the controller can be seriously affected by its parameters. However, it is time-consuming and labor-intensive to manually adjust the parameters. Most of the existing methods rely on the accurate AUV model or prior knowledge, which are difficult to obtain. Therefore, this study is concerned with the problem of automatically tuning the SMC parameters through reinforcement learning (RL). First, an AUV dynamic model with and without current influence was successfully established. Second, a continuous hybrid Model-based Model-free (MbMf) RL method based on the deterministic policy gradient was introduced and explained. Then, the framework for tuning the parameters of SMC by the RL method was described. Finally, to demonstrate the robustness and effectiveness of our approach, extensive numerical simulations were conducted on the established AUV model. The results show that our method can automatically tune the SMC parameters. The performance is more effective than SMC with fixed parameters or SMC with a purely model-free learner.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] MODEL-BASED AND MODEL-FREE CONTROL OF AUTOCORRELATED PROCESSES
    RUNGER, GC
    WILLEMAIN, TR
    JOURNAL OF QUALITY TECHNOLOGY, 1995, 27 (04) : 283 - 292
  • [32] Model-based and model-free control of autocorrelated processes
    Univ of Maryland, College Park, MD, United States
    J Qual Technol, 4 (283-292):
  • [33] Reliance on model-based and model-free control in obesity
    Lieneke K. Janssen
    Florian P. Mahner
    Florian Schlagenhauf
    Lorenz Deserno
    Annette Horstmann
    Scientific Reports, 10
  • [34] Benchmarking model-free and model-based optimal control
    Koryakovskiy, Ivan
    Kudruss, Manuel
    Babuska, Robert
    Caarls, Wouter
    Kirches, Christian
    Mombaur, Katja
    Schloeder, Johannes P.
    Vallery, Heike
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 92 : 81 - 90
  • [35] An Hybrid Model-Free Reinforcement Learning Approach for HVAC Control
    Solinas, Francesco M.
    Bellagarda, Andrea
    Macii, Enrico
    Patti, Edoardo
    Bottaccioli, Lorenzo
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [36] Connecting Model-Based and Model-Free Control With Emotion Modulation in Learning Systems
    Huang, Xiao
    Wu, Wei
    Qiao, Hong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (08): : 4624 - 4638
  • [37] Model-free control based on reinforcement learning for a wastewater treatment problem
    Syafiie, S.
    Tadeo, F.
    Martinez, E.
    Alvarez, T.
    APPLIED SOFT COMPUTING, 2011, 11 (01) : 73 - 82
  • [38] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
    Shah, Hitesh
    Gopal, M.
    IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
  • [39] Model-Free Reinforcement Learning based Lateral Control for Lane Keeping
    Zhang, Qichao
    Luo, Rui
    Zhao, Dongbin
    Luo, Chaomin
    Qian, Dianwei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [40] Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing
    Yang, Max
    Lin, Yijiong
    Church, Alex
    Lloyd, John
    Zhang, Dandan
    Barton, David A. W.
    Lepora, Nathan F.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5480 - 5487