An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引:16
|
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;
D O I
10.1002/rnc.4514
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.
引用
收藏
页码:2660 / 2672
页数:13
相关论文
共 50 条
  • [21] Some new results on finite gain lp stabilization of discrete-time linear systems subject to actuator saturation
    Bao, XY
    Lin, ZL
    Sontag, ED
    PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 4628 - 4629
  • [22] Inite gain lP stabilization of discrete-time linear systems subject to actuator saturation:: The case of p=1
    Chitour, Y
    Lin, ZL
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (12) : 2196 - 2198
  • [23] Output feedback fault-tolerant Q-learning for discrete-time linear systems with actuator faults
    Rafiee, Sajad
    Kankashvar, Mohammadrasoul
    Bolandi, Hossein
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [24] An iterative Q-learning based global consensus of discrete-time saturated multi-agent systems
    Long, Mingkang
    Su, Housheng
    Wang, Xiaoling
    Jiang, Guo-Ping
    Wang, Xiaofan
    CHAOS, 2019, 29 (10)
  • [25] Robust stabilization of switched discrete-time systems with actuator saturation
    Yongmei MA 1
    2.College of Information Science and Engineering
    3.Key Laboratory of Integrated Automation of Process Industry
    JournalofControlTheoryandApplications, 2009, 7 (04) : 454 - 458
  • [26] Robust stabilization of switched discrete-time Systems with actuator saturation
    Ma Y.
    Yang G.
    Guan W.
    Journal of Control Theory and Applications, 2009, 7 (04): : 454 - 458
  • [27] Exponential Estimates and Stabilization of Discrete-Time Singular Time-Delay Systems Subject to Actuator Saturation
    Lin, Jinxing
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2012, 2012
  • [28] Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems
    Ikemoto, Junya
    Ushio, Toshimitsu
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2021, 12 (04): : 738 - 757
  • [29] Semi-global stabilization of discrete-time linear systems with unsymmetrical saturation
    Wu, Wen-Juan
    Liu, Hai-Tao
    ISA TRANSACTIONS, 2019, 92 : 134 - 144
  • [30] STABILIZATION OF LINEAR DISCRETE-TIME-SYSTEMS WITH ACTUATOR SATURATION
    CHOU, JH
    SYSTEMS & CONTROL LETTERS, 1991, 17 (02) : 141 - 144