Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引:0
|
作者
Farzanegan, Behzad [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA
关键词
Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;
D O I
10.1109/CCTA60707.2024.10666630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.
引用
收藏
页码:651 / 656
页数:6
相关论文
共 50 条
  • [31] Learning from Adaptive Neural Control of SISO Strict-feedback Nonlinear Systems
    Wu Yuxiang
    Zhou Yongde
    Wang Cong
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2957 - 2962
  • [32] Iterative learning control of strict-feedback nonlinear time-varying systems
    Zhu S.
    Sun M.-X.
    He X.-X.
    Zidonghua Xuebao/ Acta Automatica Sinica, 2010, 36 (03): : 454 - 458
  • [33] Adaptive Backstepping Control and Application for Strict-Feedback Nonlinear Systems with Mismatched Uncertainties
    Xu, Zibin
    Min, Jianqing
    Ruan, Jian
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1392 - +
  • [34] Composite nonlinear feedback control for strict-feedback nonlinear systems with input saturation
    Lu, Tao
    Lan, Weiyao
    INTERNATIONAL JOURNAL OF CONTROL, 2019, 92 (09) : 2170 - 2177
  • [35] Reinforcement learning-based tracking control of autonomous underwater vehicles for seafloor platform data collection
    Weng, Yang
    Chun, Sehwa
    Sekimori, Yuki
    Yokohata, Hiroki
    Matsuda, Takumi
    Pajarinen, Joni
    Maki, Toshihiro
    OCEAN ENGINEERING, 2025, 328
  • [36] Learning from neural control of strict-feedback systems
    Liu, Tengfei
    Wang, Cong
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 3127 - 3132
  • [37] Quantized Output Feedback Control for a Class of Strict-Feedback Nonlinear Systems
    Sun, Kangkang
    Ma, Min
    Qiu, Jianbin
    2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 97 - 101
  • [38] Adaptive optimal dynamic surface control of strict-feedback nonlinear systems with output constraints
    Zhang, Tianping
    Xu, Haoxiang
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (05) : 2059 - 2078
  • [39] Output Feedback Stabilization of Nonlinear Strict-feedback Networked Control Systems
    Katayama, Hitoshi
    IFAC PAPERSONLINE, 2023, 56 (02): : 9499 - 9504
  • [40] Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning
    Long, Jia
    Yu, Dengxiu
    Wen, Guoxing
    Li, Li
    Wang, Zhen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 817 - 830