Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引：0

作者：

Farzanegan, Behzad ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA

来源：

2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024 | 2024年

关键词：

Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;

D O I：

10.1109/CCTA60707.2024.10666630

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.

引用

页码：651 / 656

页数：6

共 50 条

[31] Learning from Adaptive Neural Control of SISO Strict-feedback Nonlinear Systems
Wu Yuxiang
Zhou Yongde
Wang Cong
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2957 - 2962
[32] Iterative learning control of strict-feedback nonlinear time-varying systems
Zhu S.
Sun M.-X.
He X.-X.
Zidonghua Xuebao/ Acta Automatica Sinica, 2010, 36 (03): : 454 - 458
[33] Adaptive Backstepping Control and Application for Strict-Feedback Nonlinear Systems with Mismatched Uncertainties
Xu, Zibin
Min, Jianqing
Ruan, Jian
PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1392 - +
[34] Composite nonlinear feedback control for strict-feedback nonlinear systems with input saturation
Lu, Tao
Lan, Weiyao
INTERNATIONAL JOURNAL OF CONTROL, 2019, 92 (09) : 2170 - 2177
[35] Reinforcement learning-based tracking control of autonomous underwater vehicles for seafloor platform data collection
Weng, Yang
Chun, Sehwa
Sekimori, Yuki
Yokohata, Hiroki
Matsuda, Takumi
Pajarinen, Joni
Maki, Toshihiro
OCEAN ENGINEERING, 2025, 328
[36] Learning from neural control of strict-feedback systems
Liu, Tengfei
Wang, Cong
2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 3127 - 3132
[37] Quantized Output Feedback Control for a Class of Strict-Feedback Nonlinear Systems
Sun, Kangkang
Ma, Min
Qiu, Jianbin
2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 97 - 101
[38] Adaptive optimal dynamic surface control of strict-feedback nonlinear systems with output constraints
Zhang, Tianping
Xu, Haoxiang
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (05) : 2059 - 2078
[39] Output Feedback Stabilization of Nonlinear Strict-feedback Networked Control Systems
Katayama, Hitoshi
IFAC PAPERSONLINE, 2023, 56 (02): : 9499 - 9504
[40] Game-Based Backstepping Design for Strict-Feedback Nonlinear Multi-Agent Systems Based on Reinforcement Learning
Long, Jia
Yu, Dengxiu
Wen, Guoxing
Li, Li
Wang, Zhen
Chen, C. L. Philip
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 817 - 830

← 1 2 3 4 5 →