Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning

被引:10
|
作者
Roshanravan, Sajad [1 ]
Shamaghdari, Saeed [1 ]
机构
[1] Iran Univ Sci & Technol IUST, Elect Engn Dept, Tehran 1311416846, Iran
关键词
Fault detection; fault-tolerant tracking control; reinforcement learning; affine nonlinear systems; process and actuator faults;
D O I
10.1109/TASE.2022.3223702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the optimal fault-tolerant tracking control (FTTC) problem for unknown affine nonlinear continuous-time systems with process and actuator faults in the framework of reinforcement learning (RL). The proposed novel active FTTC scheme is based on adaptive optimal control theory. In this way, the FTTC problem is formulated as an optimal regulation problem for the augmented system, which consists of the controlled system and the reference trajectory. To solve the Hamilton-Jacobi-Bellman (HJB) equation of the augmented system, an identifier-critic-based online RL strategy is employed with a dual neural network (NN) approximation structure. Initially, in order to remove the requirement of prior knowledge of the system dynamics, an adaptive NN identifier is designed. The forgetting factor in the proposed identifier update law is variable and a function of the filtered state estimation error and filtered state error. Choosing this variable forgetting factor increases the convergence speed and decreases the estimation error of identifier NN weights compared to the constant one while maintaining its robustness. When a fault occurs, the system continues to operate under the former FTTC until the fault is detected. Meanwhile, the optimal FTTC design in the RL framework requires the initial admissible control condition. In order to make it possible to initiate the FTTC learning process from the former FTTC, we employed a stabilizing term in the critical update rule. The Uniformly Ultimately Boundedness (UUB) of identifier and critic NN weight errors and, as a result, the convergence of the control input to the neighborhood of the optimal solution are all proved by Lyapunov theory. In the proposed method, changes in the values of faults are detected by comparing the HJB error to a predefined threshold. Finally, the simulation results are given to validate the effectiveness of the developed method. Note to Practitioners-long-time operations and the influence of external perturbations often make the faults inevitable for many practical engineering systems which can lead to unpredictable behaviors and catastrophic impacts. In general, the faults are naturally uncertain in time, value, and pattern, that is, it is unknown when, how much, and which system components fail. Therefore, the control system must be able to tolerate an extensive set of component faults. The design of optimal model-free FTTC strategies in an adaptive manner is challenging in nonlinear systems. The proposed method is suitable for a large class of nonlinear systems with input-affine form, and guarantees the system stability in the presence of process and actuator faults.
引用
收藏
页码:569 / 580
页数:12
相关论文
共 50 条
  • [21] Adaptive fault-tolerant control of MIMO nonlinear systems
    Zhang, Lu
    Song, Yongduan
    He, Liu
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 590 - 595
  • [22] Composite DOBC with fuzzy fault-tolerant control for stochastic systems with unknown nonlinear dynamics
    Sun, Shixiang
    Ren, Tao
    Wei, Xinjiang
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (18) : 6605 - 6615
  • [23] Continuous reinforcement learning to robust fault tolerant control for a class of unknown nonlinear systems
    Farivar, Faezeh
    Ahmadabadi, Majid Nili
    APPLIED SOFT COMPUTING, 2015, 37 : 702 - 714
  • [24] Robust Fault-tolerant Tracking Control for Linear Discrete-time Systems via Reinforcement Learning Method
    Ngoc Hoai An Nguyen
    Sung Hyun Kim
    International Journal of Control, Automation and Systems, 2025, 23 (2) : 520 - 529
  • [25] Adaptive Fault-Tolerant Tracking Control for Markov Jump Systems with Partly Unknown Transition Probability
    Fan, Quanyong
    Ye, Dan
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2270 - 2275
  • [26] Adaptive Fault-Tolerant Tracking Control of A Class of Uncertain Nonlinear Systems With Actuator Faults
    Yang Yang
    Yue Dong
    Xie Xiangpeng
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 556 - 561
  • [27] Tracking Differentiator-Based Adaptive Fault-Tolerant Control for Stochastic Nonlinear Systems
    Liu, Yanli
    Ma, Hongjun
    IEEE ACCESS, 2020, 8 : 72112 - 72120
  • [28] Adaptive Fuzzy Fault-Tolerant Control for Nonlinear Multi-Agent Systems with Unknown Control Direction
    Wang, Dongyang
    Li, Yongming
    Tong, Shaocheng
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 706 - 711
  • [29] Fuzzy Adaptive Fault-Tolerant Control of Unknown Nonlinear Systems With Time-Varying Structure
    Zhang, Jin-Xi
    Yang, Guang-Hong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1904 - 1916
  • [30] Adaptive fault-tolerant attitude tracking control for spacecraft formation with unknown inertia
    Zhu, Zhihao
    Guo, Yu
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2018, 32 (01) : 13 - 26