A Reinforcement Learning-Based Approach for Optimal Output Tracking in Uncertain Nonlinear Systems with Mismatched Disturbances

被引:0
|
作者
Tang, Zezhi [1 ]
Rossiter, J. Anthony [1 ]
Panoutsos, George [1 ]
机构
[1] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England
基金
英国工程与自然科学研究理事会;
关键词
reinforcement learning; disturbance observer-based control; output tracking;
D O I
10.1109/CONTROL60310.2024.10532060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the optimal control problem of uncertain nonlinear systems is considered. A nonlinear disturbance observer (NDO) is proposed to measure the lumped uncertainties present in the system. Disturbances that do not enter the same channel as the control signal, so-called mismatched disturbances, are difficult to reject directly within the control channel. To overcome the challenge, a generalized disturbance observer-based compensator is implemented to address the uncertainty compensation problem by attenuating its influence on the output channel. In real time, by augmenting the system states with the output tracking error, we develop a composite actor-critic reinforcement learning (RL) scheme for approximating the optimal control policy as well as the ideal value function pertaining to the compensated system by solving the Hamilton-Jacobi-Bellman (HJB) equation. Concurrent learning is applied in this article by using the recorded data of the known model of the system, in order to enhance the robustness of the system by canceling the influence of the probing signal. Simulation results demonstrate the effectiveness of the proposed scheme, offering an optimal solution for the output tracking problem in a second-order model with mismatched disturbances.
引用
收藏
页码:169 / 174
页数:6
相关论文
共 50 条
  • [41] OUTPUT TRACKING CONTROL OF NONLINEAR-SYSTEMS WITH MISMATCHED UNCERTAINTIES
    LIAO, TL
    FU, LC
    HSU, CF
    [J]. SYSTEMS & CONTROL LETTERS, 1992, 18 (01) : 39 - 47
  • [42] Optimal Adaptive Output Tracking Control for a Class of Uncertain Nonlinear Systems With Actuator Failures
    Zhang S.-J.
    Wu X.
    Liu C.-S.
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2018, 44 (12): : 2188 - 2197
  • [43] Adaptive optimal output tracking of continuous-time systems via output-feedback-based reinforcement learning
    Chen, Ci
    Xie, Lihua
    Xie, Kan
    Lewis, Frank L.
    Xie, Shengli
    [J]. AUTOMATICA, 2022, 146
  • [44] Robust output tracking control of uncertain nonlinear systems
    [J]. Diangong Jishu Xuebao, 4 (39-42, 34):
  • [45] Robust adaptive output tracking for uncertain nonlinear systems
    Lee, JK
    Abe, K
    [J]. PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 361 - 366
  • [46] Reinforcement learning-based adaptive optimal tracking algorithm for Markov jump systems with partial unknown dynamics
    Tu, Yidong
    Fang, Haiyang
    Wang, Hai
    Shi, Kaibo
    He, Shuping
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (05): : 1435 - 1449
  • [47] Output tracking for nonlinear systems with mismatched uncertainties based on approximate variable structure control
    Hu, J.B.
    Chu, J.
    Su, H.Y.
    [J]. Kongzhi yu Juece/Control and Decision, 2001, 16 (01): : 25 - 28
  • [48] Approximate decoupling and output tracking for MIMO nonlinear systems with mismatched uncertainties via ADRC approach
    Wu, Ze-Hao
    Guo, Bao-Zhu
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (09): : 3873 - 3894
  • [49] Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
    Qin, Chunbin
    Jiang, Kaijun
    Zhang, Jishi
    Zhu, Tianzeng
    [J]. ENTROPY, 2023, 25 (07)
  • [50] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
    Mohammadi, Mehdi
    Arefi, Mohammad Mehdi
    Setoodeh, Peyman
    Kaynak, Okyay
    [J]. INFORMATION SCIENCES, 2021, 554 : 84 - 98