Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning

被引：10

作者：

Roshanravan, Sajad ^{[1
]}

Shamaghdari, Saeed ^{[1
]}

机构：

[1] Iran Univ Sci & Technol IUST, Elect Engn Dept, Tehran 1311416846, Iran

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 01期

关键词：

Fault detection; fault-tolerant tracking control; reinforcement learning; affine nonlinear systems; process and actuator faults;

D O I：

10.1109/TASE.2022.3223702

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates the optimal fault-tolerant tracking control (FTTC) problem for unknown affine nonlinear continuous-time systems with process and actuator faults in the framework of reinforcement learning (RL). The proposed novel active FTTC scheme is based on adaptive optimal control theory. In this way, the FTTC problem is formulated as an optimal regulation problem for the augmented system, which consists of the controlled system and the reference trajectory. To solve the Hamilton-Jacobi-Bellman (HJB) equation of the augmented system, an identifier-critic-based online RL strategy is employed with a dual neural network (NN) approximation structure. Initially, in order to remove the requirement of prior knowledge of the system dynamics, an adaptive NN identifier is designed. The forgetting factor in the proposed identifier update law is variable and a function of the filtered state estimation error and filtered state error. Choosing this variable forgetting factor increases the convergence speed and decreases the estimation error of identifier NN weights compared to the constant one while maintaining its robustness. When a fault occurs, the system continues to operate under the former FTTC until the fault is detected. Meanwhile, the optimal FTTC design in the RL framework requires the initial admissible control condition. In order to make it possible to initiate the FTTC learning process from the former FTTC, we employed a stabilizing term in the critical update rule. The Uniformly Ultimately Boundedness (UUB) of identifier and critic NN weight errors and, as a result, the convergence of the control input to the neighborhood of the optimal solution are all proved by Lyapunov theory. In the proposed method, changes in the values of faults are detected by comparing the HJB error to a predefined threshold. Finally, the simulation results are given to validate the effectiveness of the developed method. Note to Practitioners-long-time operations and the influence of external perturbations often make the faults inevitable for many practical engineering systems which can lead to unpredictable behaviors and catastrophic impacts. In general, the faults are naturally uncertain in time, value, and pattern, that is, it is unknown when, how much, and which system components fail. Therefore, the control system must be able to tolerate an extensive set of component faults. The design of optimal model-free FTTC strategies in an adaptive manner is challenging in nonlinear systems. The proposed method is suitable for a large class of nonlinear systems with input-affine form, and guarantees the system stability in the presence of process and actuator faults.

引用

页码：569 / 580

页数：12

共 50 条

[21] Adaptive fault-tolerant control of MIMO nonlinear systems
Zhang, Lu
Song, Yongduan
He, Liu
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 590 - 595
[22] Composite DOBC with fuzzy fault-tolerant control for stochastic systems with unknown nonlinear dynamics
Sun, Shixiang
Ren, Tao
Wei, Xinjiang
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (18) : 6605 - 6615
[23] Continuous reinforcement learning to robust fault tolerant control for a class of unknown nonlinear systems
Farivar, Faezeh
Ahmadabadi, Majid Nili
APPLIED SOFT COMPUTING, 2015, 37 : 702 - 714
[24] Robust Fault-tolerant Tracking Control for Linear Discrete-time Systems via Reinforcement Learning Method
Ngoc Hoai An Nguyen
Sung Hyun Kim
International Journal of Control, Automation and Systems, 2025, 23 (2) : 520 - 529
[25] Adaptive Fault-Tolerant Tracking Control for Markov Jump Systems with Partly Unknown Transition Probability
Fan, Quanyong
Ye, Dan
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2270 - 2275
[26] Adaptive Fault-Tolerant Tracking Control of A Class of Uncertain Nonlinear Systems With Actuator Faults
Yang Yang
Yue Dong
Xie Xiangpeng
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 556 - 561
[27] Tracking Differentiator-Based Adaptive Fault-Tolerant Control for Stochastic Nonlinear Systems
Liu, Yanli
Ma, Hongjun
IEEE ACCESS, 2020, 8 : 72112 - 72120
[28] Adaptive Fuzzy Fault-Tolerant Control for Nonlinear Multi-Agent Systems with Unknown Control Direction
Wang, Dongyang
Li, Yongming
Tong, Shaocheng
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 706 - 711
[29] Fuzzy Adaptive Fault-Tolerant Control of Unknown Nonlinear Systems With Time-Varying Structure
Zhang, Jin-Xi
Yang, Guang-Hong
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1904 - 1916
[30] Adaptive fault-tolerant attitude tracking control for spacecraft formation with unknown inertia
Zhu, Zhihao
Guo, Yu
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2018, 32 (01) : 13 - 26

← 1 2 3 4 5 →