Experience replay-based output feedback Q-learning scheme for optimal output tracking control of discrete-time linear systems

被引:6
|
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
discounting factor; optimal tracking; output feedback; Q-learning; ADAPTIVE OPTIMAL-CONTROL; TRAJECTORY TRACKING; DESIGN; LEADER; MRAC;
D O I
10.1002/acs.2981
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on solving the adaptive optimal tracking control problem for discrete-time linear systems with unknown system dynamics using output feedback. A Q-learning-based optimal adaptive control scheme is presented to learn the feedback and feedforward control parameters of the optimal tracking control law. The optimal feedback parameters are learned using the proposed output feedback Q-learning Bellman equation, whereas the estimation of the optimal feedforward control parameters is achieved using an adaptive algorithm that guarantees convergence to zero of the tracking error. The proposed method has the advantage that it is not affected by the exploration noise bias problem and does not require a discounting factor, relieving the two bottlenecks in the past works in achieving stability guarantee and optimal asymptotic tracking. Furthermore, the proposed scheme employs the experience replay technique for data-driven learning, which is data efficient and relaxes the persistence of excitation requirement in learning the feedback control parameters. It is shown that the learned feedback control parameters converge to the optimal solution of the Riccati equation and the feedforward control parameters converge to the solution of the Sylvester equation. Simulation studies on two practical systems have been carried out to show the effectiveness of the proposed scheme.
引用
收藏
页码:1825 / 1842
页数:18
相关论文
共 50 条
  • [21] Output Tracking Control of Discrete-Time Nonlinear Systems by Output Feedback Passivity based Adaptive PID
    Mizumoto, Ikuro
    Takagi, Taro
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 6954 - 6959
  • [22] Output Tracking Control of Discrete-Time Nonlinear Systems by Adaptive PID based on Output Feedback Passivity
    Mizumoto, Ikuro
    Takagi, Taro
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2584 - 2589
  • [23] Robust output regulation of discrete-time linear systems by quantized output feedback control
    Liu, Tao
    Huang, Jie
    AUTOMATICA, 2019, 107 : 587 - 590
  • [24] Observer-based output feedback control of discrete-time linear systems with input and output delays
    Zhou, Bin
    INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (11) : 2252 - 2272
  • [25] Optimal tracking control for discrete-time modal persistent dwell time switched systems based on Q-learning
    Zhang, Xuewen
    Wang, Yun
    Xia, Jianwei
    Li, Feng
    Shen, Hao
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (06): : 3327 - 3341
  • [26] Linear quadratic optimal control method based on output feedback inverse reinforcement Q-learning
    Liu, Wen
    Fan, Jia-Lu
    Xue, Wen-Qian
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2024, 41 (08): : 1469 - 1479
  • [27] Output Feedback Control for a Class of Switching Discrete-Time Linear Systems
    Alessandri, A.
    Bedouhene, F.
    Kheloufi, H.
    Zemouche, A.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1533 - 1538
  • [28] Approximate optimal output tracking control for nonlinear discrete-time systems
    Tang, Gong-You
    Liu, Yi-Min
    Zhang, Yong
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2010, 27 (03): : 400 - 405
  • [29] Quantized H∞ output feedback control for linear discrete-time systems
    Lu, Renquan
    Zhou, Xingxing
    Wu, Fang
    Xue, Anke
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2013, 350 (08): : 2096 - 2108
  • [30] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
    Rizvi, Syed Ali Asad
    Lin, Zongli
    AUTOMATICA, 2018, 95 : 213 - 221