An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引：16

作者：

Rizvi, Syed Ali Asad ^{[1
]}

Lin, Zongli ^{[1
]}

机构：

[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2019年 / 29卷 / 09期

关键词：

actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;

D O I：

10.1002/rnc.4514

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.

引用

页码：2660 / 2672

页数：13

共 50 条

[21] Some new results on finite gain lp stabilization of discrete-time linear systems subject to actuator saturation
Bao, XY
Lin, ZL
Sontag, ED
PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 4628 - 4629
[22] Inite gain lP stabilization of discrete-time linear systems subject to actuator saturation:: The case of p=1
Chitour, Y
Lin, ZL
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (12) : 2196 - 2198
[23] Output feedback fault-tolerant Q-learning for discrete-time linear systems with actuator faults
Rafiee, Sajad
Kankashvar, Mohammadrasoul
Bolandi, Hossein
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[24] An iterative Q-learning based global consensus of discrete-time saturated multi-agent systems
Long, Mingkang
Su, Housheng
Wang, Xiaoling
Jiang, Guo-Ping
Wang, Xiaofan
CHAOS, 2019, 29 (10)
[25] Robust stabilization of switched discrete-time systems with actuator saturation
Yongmei MA 1
2.College of Information Science and Engineering
3.Key Laboratory of Integrated Automation of Process Industry
JournalofControlTheoryandApplications, 2009, 7 (04) : 454 - 458
[26] Robust stabilization of switched discrete-time Systems with actuator saturation
Ma Y.
Yang G.
Guan W.
Journal of Control Theory and Applications, 2009, 7 (04): : 454 - 458
[27] Exponential Estimates and Stabilization of Discrete-Time Singular Time-Delay Systems Subject to Actuator Saturation
Lin, Jinxing
DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2012, 2012
[28] Continuous deep Q-learning with a simulator for stabilization of uncertain discrete-time systems
Ikemoto, Junya
Ushio, Toshimitsu
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2021, 12 (04): : 738 - 757
[29] Semi-global stabilization of discrete-time linear systems with unsymmetrical saturation
Wu, Wen-Juan
Liu, Hai-Tao
ISA TRANSACTIONS, 2019, 92 : 134 - 144
[30] STABILIZATION OF LINEAR DISCRETE-TIME-SYSTEMS WITH ACTUATOR SATURATION
CHOU, JH
SYSTEMS & CONTROL LETTERS, 1991, 17 (02) : 141 - 144

← 1 2 3 4 5 →