Experience replay-based output feedback Q-learning scheme for optimal output tracking control of discrete-time linear systems

被引：6

作者：

Rizvi, Syed Ali Asad ^{[1
]}

Lin, Zongli ^{[1
]}

机构：

[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2019年 / 33卷 / 12期

关键词：

discounting factor; optimal tracking; output feedback; Q-learning; ADAPTIVE OPTIMAL-CONTROL; TRAJECTORY TRACKING; DESIGN; LEADER; MRAC;

D O I：

10.1002/acs.2981

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper focuses on solving the adaptive optimal tracking control problem for discrete-time linear systems with unknown system dynamics using output feedback. A Q-learning-based optimal adaptive control scheme is presented to learn the feedback and feedforward control parameters of the optimal tracking control law. The optimal feedback parameters are learned using the proposed output feedback Q-learning Bellman equation, whereas the estimation of the optimal feedforward control parameters is achieved using an adaptive algorithm that guarantees convergence to zero of the tracking error. The proposed method has the advantage that it is not affected by the exploration noise bias problem and does not require a discounting factor, relieving the two bottlenecks in the past works in achieving stability guarantee and optimal asymptotic tracking. Furthermore, the proposed scheme employs the experience replay technique for data-driven learning, which is data efficient and relaxes the persistence of excitation requirement in learning the feedback control parameters. It is shown that the learned feedback control parameters converge to the optimal solution of the Riccati equation and the feedforward control parameters converge to the solution of the Sylvester equation. Simulation studies on two practical systems have been carried out to show the effectiveness of the proposed scheme.

引用

页码：1825 / 1842

页数：18

共 50 条

[21] Output Tracking Control of Discrete-Time Nonlinear Systems by Output Feedback Passivity based Adaptive PID
Mizumoto, Ikuro
Takagi, Taro
2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 6954 - 6959
[22] Output Tracking Control of Discrete-Time Nonlinear Systems by Adaptive PID based on Output Feedback Passivity
Mizumoto, Ikuro
Takagi, Taro
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2584 - 2589
[23] Robust output regulation of discrete-time linear systems by quantized output feedback control
Liu, Tao
Huang, Jie
AUTOMATICA, 2019, 107 : 587 - 590
[24] Observer-based output feedback control of discrete-time linear systems with input and output delays
Zhou, Bin
INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (11) : 2252 - 2272
[25] Optimal tracking control for discrete-time modal persistent dwell time switched systems based on Q-learning
Zhang, Xuewen
Wang, Yun
Xia, Jianwei
Li, Feng
Shen, Hao
OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (06): : 3327 - 3341
[26] Linear quadratic optimal control method based on output feedback inverse reinforcement Q-learning
Liu, Wen
Fan, Jia-Lu
Xue, Wen-Qian
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (08): : 1469 - 1479
[27] Output Feedback Control for a Class of Switching Discrete-Time Linear Systems
Alessandri, A.
Bedouhene, F.
Kheloufi, H.
Zemouche, A.
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1533 - 1538
[28] Approximate optimal output tracking control for nonlinear discrete-time systems
Tang, Gong-You
Liu, Yi-Min
Zhang, Yong
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2010, 27 (03): : 400 - 405
[29] Quantized H∞ output feedback control for linear discrete-time systems
Lu, Renquan
Zhou, Xingxing
Wu, Fang
Xue, Anke
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2013, 350 (08): : 2096 - 2108
[30] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
Rizvi, Syed Ali Asad
Lin, Zongli
AUTOMATICA, 2018, 95 : 213 - 221

← 1 2 3 4 5 →