Prioritized experience replay based reinforcement learning for adaptive tracking control of autonomous underwater vehicle

被引:3
|
作者
Li, Ting [1 ]
Yang, Dongsheng [2 ]
Xie, Xiangpeng [3 ]
机构
[1] Beijing Inst Graph Commun, Coll Mech & Elect Engn, Beijing 102627, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Inst Adv Technol, Nanjing 210023, Peoples R China
关键词
Adaptive dynamic programming; Tracking control; Disturbances; Data-driven; Experience replay; Autonomous underwater vehicle; TRAJECTORY-TRACKING; DESIGN; AUV;
D O I
10.1016/j.amc.2022.127734
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
A novel adaptive trajectory tracking control method is proposed in this paper for au- tonomous underwater vehicle systems with external disturbances. The control action is composed of sliding mode control and action-depended heuristic dynamic programming (ADHDP) controller to track the expected sailing position and angle in the horizontal co- ordinate system with fixed depth. As an auxiliary control of sliding mode control, the AD- HDP controller observes the difference between the actual sailing position/angle and the expected sailing position/angle, and adaptively provides corresponding supplementary con- trol actions using a data-driven fashion. At the same time, we design a weight-related pri- ority experience replay (PER) technology to update the online weight network by using the relevant historical data stored in the database to improve the learning rate. The proposed algorithm can adjust parameters online under various conditions. Furthermore, it is very suitable for underwater vehicle systems with parameter uncertainties and external distur- bances. Based on Lyapunov stability method, the stability of closed-loop system state and network weight error is analyzed. Finally, the validity of our control strategy is verified by simulation.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Batch process control based on reinforcement learning with segmented prioritized experience replay
    Xu, Chen
    Ma, Junwei
    Tao, Hongfeng
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (05)
  • [2] Deep Reinforcement Learning Based Optimal Trajectory Tracking Control of Autonomous Underwater Vehicle
    Yu, Runsheng
    Shi, Zhenyu
    Huang, Chaoxing
    Li, Tenglong
    Ma, Qiongxiong
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 4958 - 4965
  • [3] Adaptive Tracking Control of an Autonomous Underwater Vehicle
    Basant Kumar Sahu
    Bidyadhar Subudhi
    [J]. Machine Intelligence Research, 2014, 11 (03) : 299 - 307
  • [4] Adaptive tracking control of an autonomous underwater vehicle
    Centre for Industrial Electronics and Robotics, Department of Electrical Engineering, National Institute of Technology, Rourkela, India
    [J]. Int. J. Autom. Comput., 3 (299-307):
  • [5] Adaptive tracking control of an autonomous underwater vehicle
    Sahu B.K.
    Subudhi B.
    [J]. International Journal of Automation and Computing, 2014, 11 (3) : 299 - 307
  • [6] Adaptive Tracking Control of an Autonomous Underwater Vehicle
    Sahu, Basant Kumar
    Subudhi, Bidyadhar
    [J]. International Journal of Automation and Computing, 2014, 11 (03) : 299 - 307
  • [7] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    [J]. APPLIED OCEAN RESEARCH, 2022, 129
  • [8] Autonomous reinforcement learning with experience replay
    Wawrzynski, Pawel
    Tanwani, Ajay Kumar
    [J]. NEURAL NETWORKS, 2013, 41 : 156 - 167
  • [9] Dynamic Target Tracking of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Shi, Jiaxiang
    Fang, Jianer
    Zhang, Qizhong
    Wu, Qiuxuan
    Zhang, Botao
    Gao, Farong
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (10)
  • [10] Adaptive Region Tracking Control for Autonomous Underwater Vehicle
    Li, X.
    Hou, S. P.
    Cheah, C. C.
    [J]. 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2129 - 2134