Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引：0

作者：

Farzanegan, Behzad ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA

来源：

2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024 | 2024年

关键词：

Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;

D O I：

10.1109/CCTA60707.2024.10666630

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.

引用

页码：651 / 656

页数：6

共 50 条

[41] Output-constrained finite-time tracking control for strict-feedback nonlinear systems
Wang, Chunxiao
Wu, Yuqiang
2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 826 - 831
[42] Constrained Adaptive Neural Control of Nonlinear Strict-Feedback Systems with Input Dead-Zone
Shi, Jingping
Wu, Zhonghua
Lu, Jingchao
MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
[43] Reinforcement Learning-Based Predictive Control for Autonomous Electrified Vehicles
Liu, Teng
Yang, Chao
Hu, Chuanzheng
Wang, Hong
Li, Li
Cao, Dongpu
Wang, Fei-Yue
2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 185 - 190
[44] Robust adaptive control of strict-feedback nonlinear systems with nonlinear parameterization
Wang, J
Qu, ZH
PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3026 - 3031
[45] Stabilization Control for Strict-Feedback Nonlinear Systems With Time Delays
Li, Wenjie
Zhang, Zhengqiang
Ge, Shuzhi Sam
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7549 - 7560
[46] Predictor-based Consensus Control of Uncertain Nonlinear Strict-feedback Systems
Wang, Wei
Yu, Yang
IEEE ICCSS 2016 - 2016 3RD INTERNATIONAL CONFERENCE ON INFORMATIVE AND CYBERNETICS FOR COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2016, : 294 - 298
[47] Robust Control of Nonlinear Strict-feedback Systems with Measurement Errors
Liu, Tengfei
Jiang, Zhong-Ping
Hill, David J.
2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 2034 - 2039
[48] An approach to neural control of a class of strict-feedback nonlinear systems
Sun, Gang
Wang, Dan
Peng, Zhou-Hua
Lan, Wei-Yao
Wang, Hao
Kongzhi yu Juece/Control and Decision, 2013, 28 (05): : 778 - 781
[49] Optimized backstepping tracking control using reinforcement learning for strict-feedback nonlinear systems with monotone tube performance boundaries
Zhang, Gengning
Wang, Xin
Wang, Ziming
Pang, Ning
INTERNATIONAL JOURNAL OF CONTROL, 2025,
[50] Subsystem-Based Control With Modularity for Strict-Feedback Form Nonlinear Systems
Koivumaki, Janne
Humaloja, Jukka-Pekka
Paunonen, Lassi
Zhu, Wen-Hong
Mattila, Jouni
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (07) : 4336 - 4343

← 1 2 3 4 5 →