An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引:16
|
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;
D O I
10.1002/rnc.4514
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.
引用
收藏
页码:2660 / 2672
页数:13
相关论文
共 50 条
  • [41] Stability analysis and stabilization for quadratic discrete-time systems with actuator saturation
    Chen, Fu
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 1462 - 1466
  • [42] Decentralized stabilization of linear time invariant systems subject to actuator saturation
    Stoorvogel, Anton A.
    Saberi, Ali
    Deliu, Ciprian
    Sannuti, Peddapullaiah
    ADVANCED STRATEGIES IN CONTROL SYSTEMS WITH INPUT AND OUTPUT CONSTRAINTS, 2007, 346 : 397 - 419
  • [43] Global consensus in homogeneous networks of discrete-time agents subject to actuator saturation
    Yang, Tao
    Meng, Ziyang
    Dimarogonas, Dimos V.
    Johansson, Karl H.
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 244 - 249
  • [44] Forwarding for discrete-time linear systems: optimality and global stabilization under input saturation
    Zoboli, Samuele
    Astolfi, Daniele
    Mattioni, Mattia
    Simpson-Porco, John W.
    van de Wouw, Nathan
    IFAC PAPERSONLINE, 2024, 58 (21): : 108 - 113
  • [45] Event-Triggered Optimal Regulation of Uncertain Linear Discrete-time Systems by using Q-learning Scheme
    Sahoo, Avimanyu
    Jagannathan, S.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1233 - 1238
  • [46] Stabilization of Discrete-Time Linear Systems Subject to Input Saturation and Multiple Unknown Constant Delays
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (06) : 1667 - 1672
  • [47] Stabilization of discrete-time linear systems subject to input saturation and multiple unknown constant delays
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 958 - 963
  • [48] Stability analysis and antiwindup design of uncertain discrete-time switched linear systems subject to actuator saturation
    Xinquan Zhang
    Mingshun Wang
    Jun Zhao
    Journal of Control Theory and Applications, 2012, 10 (3): : 325 - 331
  • [49] The Stabilization of Switched Linear Systems Subject to Actuator Saturation
    Jing, Li
    Yang, Jin
    Du, Hongbo
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 4146 - 4148
  • [50] Q-Learning Methods for LQR Control of Completely Unknown Discrete-Time Linear Systems
    Fan, Wenwu
    Xiong, Junlin
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 5933 - 5943