A Reinforcement Learning-Based Approach for Optimal Output Tracking in Uncertain Nonlinear Systems with Mismatched Disturbances

被引：0

作者：

Tang, Zezhi ^{[1
]}

Rossiter, J. Anthony ^{[1
]}

Panoutsos, George ^{[1
]}

机构：

[1] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England

来源：

2024 UKACC 14TH INTERNATIONAL CONFERENCE ON CONTROL, CONTROL | 2024年

基金：

英国工程与自然科学研究理事会;

关键词：

reinforcement learning; disturbance observer-based control; output tracking;

D O I：

10.1109/CONTROL60310.2024.10532060

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, the optimal control problem of uncertain nonlinear systems is considered. A nonlinear disturbance observer (NDO) is proposed to measure the lumped uncertainties present in the system. Disturbances that do not enter the same channel as the control signal, so-called mismatched disturbances, are difficult to reject directly within the control channel. To overcome the challenge, a generalized disturbance observer-based compensator is implemented to address the uncertainty compensation problem by attenuating its influence on the output channel. In real time, by augmenting the system states with the output tracking error, we develop a composite actor-critic reinforcement learning (RL) scheme for approximating the optimal control policy as well as the ideal value function pertaining to the compensated system by solving the Hamilton-Jacobi-Bellman (HJB) equation. Concurrent learning is applied in this article by using the recorded data of the known model of the system, in order to enhance the robustness of the system by canceling the influence of the probing signal. Simulation results demonstrate the effectiveness of the proposed scheme, offering an optimal solution for the output tracking problem in a second-order model with mismatched disturbances.

引用

页码：169 / 174

页数：6

共 50 条

[41] OUTPUT TRACKING CONTROL OF NONLINEAR-SYSTEMS WITH MISMATCHED UNCERTAINTIES
LIAO, TL
FU, LC
HSU, CF
[J]. SYSTEMS & CONTROL LETTERS, 1992, 18 (01) : 39 - 47
[42] Optimal Adaptive Output Tracking Control for a Class of Uncertain Nonlinear Systems With Actuator Failures
Zhang S.-J.
Wu X.
Liu C.-S.
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2018, 44 (12): : 2188 - 2197
[43] Adaptive optimal output tracking of continuous-time systems via output-feedback-based reinforcement learning
Chen, Ci
Xie, Lihua
Xie, Kan
Lewis, Frank L.
Xie, Shengli
[J]. AUTOMATICA, 2022, 146
[44] Robust output tracking control of uncertain nonlinear systems
[J]. Diangong Jishu Xuebao, 4 (39-42, 34):
[45] Robust adaptive output tracking for uncertain nonlinear systems
Lee, JK
Abe, K
[J]. PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 361 - 366
[46] Reinforcement learning-based adaptive optimal tracking algorithm for Markov jump systems with partial unknown dynamics
Tu, Yidong
Fang, Haiyang
Wang, Hai
Shi, Kaibo
He, Shuping
[J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2022, 43 (05): : 1435 - 1449
[47] Output tracking for nonlinear systems with mismatched uncertainties based on approximate variable structure control
Hu, J.B.
Chu, J.
Su, H.Y.
[J]. Kongzhi yu Juece/Control and Decision, 2001, 16 (01): : 25 - 28
[48] Approximate decoupling and output tracking for MIMO nonlinear systems with mismatched uncertainties via ADRC approach
Wu, Ze-Hao
Guo, Bao-Zhu
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (09): : 3873 - 3894
[49] Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
Qin, Chunbin
Jiang, Kaijun
Zhang, Jishi
Zhu, Tianzeng
[J]. ENTROPY, 2023, 25 (07)
[50] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
Mohammadi, Mehdi
Arefi, Mohammad Mehdi
Setoodeh, Peyman
Kaynak, Okyay
[J]. INFORMATION SCIENCES, 2021, 554 : 84 - 98

← 1 2 3 4 5 →