Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引:0
|
作者
Farzanegan, Behzad [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA
关键词
Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;
D O I
10.1109/CCTA60707.2024.10666630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.
引用
收藏
页码:651 / 656
页数:6
相关论文
共 50 条
  • [41] Output-constrained finite-time tracking control for strict-feedback nonlinear systems
    Wang, Chunxiao
    Wu, Yuqiang
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 826 - 831
  • [42] Constrained Adaptive Neural Control of Nonlinear Strict-Feedback Systems with Input Dead-Zone
    Shi, Jingping
    Wu, Zhonghua
    Lu, Jingchao
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [43] Reinforcement Learning-Based Predictive Control for Autonomous Electrified Vehicles
    Liu, Teng
    Yang, Chao
    Hu, Chuanzheng
    Wang, Hong
    Li, Li
    Cao, Dongpu
    Wang, Fei-Yue
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 185 - 190
  • [44] Robust adaptive control of strict-feedback nonlinear systems with nonlinear parameterization
    Wang, J
    Qu, ZH
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3026 - 3031
  • [45] Stabilization Control for Strict-Feedback Nonlinear Systems With Time Delays
    Li, Wenjie
    Zhang, Zhengqiang
    Ge, Shuzhi Sam
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7549 - 7560
  • [46] Predictor-based Consensus Control of Uncertain Nonlinear Strict-feedback Systems
    Wang, Wei
    Yu, Yang
    IEEE ICCSS 2016 - 2016 3RD INTERNATIONAL CONFERENCE ON INFORMATIVE AND CYBERNETICS FOR COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2016, : 294 - 298
  • [47] Robust Control of Nonlinear Strict-feedback Systems with Measurement Errors
    Liu, Tengfei
    Jiang, Zhong-Ping
    Hill, David J.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 2034 - 2039
  • [48] An approach to neural control of a class of strict-feedback nonlinear systems
    Sun, Gang
    Wang, Dan
    Peng, Zhou-Hua
    Lan, Wei-Yao
    Wang, Hao
    Kongzhi yu Juece/Control and Decision, 2013, 28 (05): : 778 - 781
  • [49] Optimized backstepping tracking control using reinforcement learning for strict-feedback nonlinear systems with monotone tube performance boundaries
    Zhang, Gengning
    Wang, Xin
    Wang, Ziming
    Pang, Ning
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [50] Subsystem-Based Control With Modularity for Strict-Feedback Form Nonlinear Systems
    Koivumaki, Janne
    Humaloja, Jukka-Pekka
    Paunonen, Lassi
    Zhu, Wen-Hong
    Mattila, Jouni
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (07) : 4336 - 4343