Fuzzy Control Based on Reinforcement Learning and Subsystem Error Derivatives for Strict-Feedback Systems With an Observer

被引:37
|
作者
Li, Dongdong [1 ,2 ,3 ]
Dong, Jiuxiang [1 ,2 ,3 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Key Lab Vibrat & Control Aeroprop Syst, Minist Educ China, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); fuzzy adaptive control; fuzzy logic systems (FLSs); fuzzy state observer; optimized backstepping control (OBC); reinforcement learning (RL); TRACKING CONTROL;
D O I
10.1109/TFUZZ.2022.3227993
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, a novel optimized fuzzy adaptive control method based on tracking error derivatives of subsystems is proposed for strict-feedback systems with unmeasurable states. A cost function based on the tracking error derivative is used. It not only solves the problem that the traditional input quadratic cost function at the infinite time is unbounded, but also solves the problem that the optimal control input derived from the cost function with exponential discount factor cannot make the error asymptotically stable. Considering the case where the states are unmeasurable, a fuzzy state observer is designed that removes the restriction of the Hurwitz equation for the gain parameters. Based on reinforcement learning, the observer, and error derivative cost function, an improved optimized backstepping control method is given. Using observed information and actor-critic structure to train fuzzy logic systems online, the control inputs are obtained to achieve approximate optimal control. Finally, all closed-loop signals are proved to be bounded by the Lyapunov method, and the effectiveness and advantages of the proposed algorithm are verified through two examples.
引用
收藏
页码:2509 / 2521
页数:13
相关论文
共 50 条
  • [1] Finite-time adaptive optimal control of uncertain strict-feedback nonlinear systems based on fuzzy observer and reinforcement learning
    Sun, Yue
    Chen, Ming
    Peng, Kaixiang
    Wu, Libing
    Liu, Cungen
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (08) : 1553 - 1570
  • [2] Observer-based fuzzy adaptive control for strict-feedback nonlinear systems
    Tong, Shaocheng
    Li, Yongming
    FUZZY SETS AND SYSTEMS, 2009, 160 (12) : 1749 - 1764
  • [4] Neural Learning Control of Strict-Feedback Systems Using Disturbance Observer
    Xu, Bin
    Shou, Yingxin
    Luo, Jun
    Pu, Huayan
    Shi, Zhongke
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1296 - 1307
  • [5] Iterative learning control of error-constrained strict-feedback systems
    Chen J.-Y.
    Sun M.-X.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2020, 37 (06): : 1358 - 1366
  • [6] Subsystem-Based Control With Modularity for Strict-Feedback Form Nonlinear Systems
    Koivumaki, Janne
    Humaloja, Jukka-Pekka
    Paunonen, Lassi
    Zhu, Wen-Hong
    Mattila, Jouni
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (07) : 4336 - 4343
  • [7] Optimized Backstepping Control Using Reinforcement Learning of Observer-Critic-Actor Architecture Based on Fuzzy System for a Class of Nonlinear Strict-Feedback Systems
    Wen, Guoxing
    Li, Bin
    Niu, Ben
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (10) : 4322 - 4335
  • [8] Composite Learning Fuzzy Control of Stochastic Nonlinear Strict-Feedback Systems
    Wang, Xia
    Xu, Bin
    Li, Shuai
    Yang, Qinmin
    Fan, Quanyong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (04) : 705 - 715
  • [9] Learning-based Fuzzy Control for Strict-feedback Nonlinear Systems with Unknown Uncertainties
    Ma, Min
    Liu, Xiaokun
    Huang, He
    Wang, Tong
    Qiu, Jianbin
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2089 - 2093
  • [10] Observer-based adaptive fuzzy funnel control for strict-feedback nonlinear systems with unknown control coefficients
    Liu, Cungen
    Liu, Xiaoping
    Wang, Huanqing
    Zhou, Yucheng
    Lu, Shouyin
    NEUROCOMPUTING, 2019, 358 : 467 - 478