A Q-learning predictive control scheme with guaranteed stability

被引:9
|
作者
Beckenbach, Lukas [1 ]
Osinenko, Pavel [1 ]
Streif, Stefan [1 ]
机构
[1] Tech Univ Chemnitz, Automat Control & Syst Dynam Lab, D-09107 Chemnitz, Germany
关键词
Predictive control; Q-Learning; Cost shaping; Nominal stability; RECEDING-HORIZON CONTROL; DISCRETE-TIME-SYSTEMS; NONLINEAR-SYSTEMS; FINITE; PERFORMANCE; MPC; STATE;
D O I
10.1016/j.ejcon.2020.03.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model-based predictive controllers are used to tackle control tasks in which constraints on state, input or both need to be satisfied. These controllers commonly optimize a fixed finite-horizon cost, which relates to an infinite-horizon (IH) cost profile, while the resulting closed-loop under the predictive controller yields an in general suboptimal IH cost. To capture the optimal IH cost and the associated control policy, reinforcement learning methods, such as Q-learning, that approximate said cost via a parametric architec-ture can be employed. Conversely to predictive controllers, however, closed-loop stability has rarely been investigated under the approximation associated controller in explicit dependence of these parameters. It is the aim of this work to incorporate model-based Q-learning into a predictive control setup as to provide closed-loop stability in online learning, while eventually improving the performance of finite-horizon controllers. The proposed scheme provides nominal asymptotic stability and the observation was made that the suggested learning approach could in fact improve the performance against a baseline predictive controller. (c) 2020 European Control Association. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:167 / 178
页数:12
相关论文
共 50 条
  • [1] MODEL PREDICTIVE CONTROL AND MODEL PREDICTIVE Q-LEARNING FOR STRUCTURAL VIBRATION CONTROL
    Phan, Minh Q.
    Azad, Seyed Mandi B.
    ASTRODYNAMICS 2017, PTS I-IV, 2018, 162 : 17 - 36
  • [2] HARQ Control Scheme by Fuzzy Q-Learning for HSPA+
    Chung, Wen-Ching
    Chen, Ying-Yu
    Chang, Chung-Ju
    2011 IEEE 73RD VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2011,
  • [3] Q-learning of the storage function in Economic Nonlinear Model Predictive Control
    Kordabad, Arash Bahari
    Gros, Sebastien
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [4] End-point parametrization and guaranteed stability for a model predictive control scheme
    Weiland, S
    Stoorvogel, AA
    Tiagounov, AA
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4832 - 4837
  • [5] A computationally attractive nonlinear predictive control scheme with guaranteed stability for stable systems
    Chen, H
    Allgower, F
    JOURNAL OF PROCESS CONTROL, 1998, 8 (5-6) : 475 - 485
  • [6] Information Theoretic Model Predictive Q-Learning
    Bhardwaj, Mohak
    Handa, Ankur
    Fox, Dieter
    Boots, Byron
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 840 - 850
  • [7] Improving Nonlinear Model Predictive Control Laws via Implicit Q-Learning
    Alhazmi, Khalid
    Sarathy, S. Mani
    IFAC PAPERSONLINE, 2023, 56 (02): : 10027 - 10032
  • [8] Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm
    Wei, Qinglai
    Liu, Derong
    Song, Ruizhuo
    2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 125 - 130
  • [9] Pricing Scheme based Nash Q-Learning Load Control in Smart Grids
    Li, Xin
    Qin, Xiaoning
    Zang, Chuanzhi
    Che, Weiwei
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 5198 - 5202
  • [10] A quasi-infinite horizon nonlinear model predictive control scheme with guaranteed stability
    Chen, H
    Allgower, F
    AUTOMATICA, 1998, 34 (10) : 1205 - 1217