An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引:15
|
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;
D O I
10.1002/rnc.4514
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.
引用
收藏
页码:2660 / 2672
页数:13
相关论文
共 50 条
  • [1] Finite gain stabilization of discrete-time linear systems subject to actuator saturation
    Bao, XY
    Lin, ZL
    Sontag, ED
    AUTOMATICA, 2000, 36 (02) : 269 - 277
  • [2] Stabilization with decay rate analysis for discrete-time linear systems subject to actuator saturation
    Ma, Yong-Mei
    Yang, Guang-Hong
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 1887 - 1892
  • [3] Global Stabilization of Discrete-Time Linear Systems Subject to Input Saturation and Time Delay
    Yang, Xuefei
    Zhou, Bin
    Mazenc, Frederic
    Lam, James
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (03) : 1345 - 1352
  • [4] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation"
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    Sannuti, Peddapullaiah
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 3808 - 3812
  • [5] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    Sannuti, Peddapullaiah
    AUTOMATICA, 2012, 48 (05) : 699 - 711
  • [6] Analysis and design for discrete-time linear systems subject to actuator saturation
    Hu, TS
    Lin, ZL
    Chen, BM
    SYSTEMS & CONTROL LETTERS, 2002, 45 (02) : 97 - 112
  • [7] Stability analysis for linear discrete-time systems subject to actuator saturation
    Yongmei MA 1
    2.College of Information Science and Engineering
    3.Key Laboratory of Integrated Automation of Process Industry (Ministry of Education)
    Control Theory and Technology, 2010, 8 (02) : 245 - 248
  • [8] Stability analysis for linear discrete-time systems subject to actuator saturation
    Ma Y.
    Yang G.
    Journal of Control Theory and Applications, 2010, 8 (02): : 245 - 248
  • [9] Analysis and design for discrete-time linear systems subject to actuator saturation
    Hu, TS
    Lin, ZL
    Chen, BM
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4675 - 4680
  • [10] Performance analysis for linear discrete-time systems subject to actuator saturation
    Ma, Yong-Mei
    Yang, Guang-Hong
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 3608 - 3613