On-Policy Data-Driven Linear Quadratic Regulator via Model Reference Adaptive Reinforcement Learning

被引:0
|
作者
Borghesi, Marco [1 ]
Bosso, Alessandro [1 ]
Notarstefano, Giuseppe [1 ]
机构
[1] Alma Mater Studiorum Univ, Dept Elect Elect & Informat Engn, Bologna, Italy
关键词
SYSTEMS;
D O I
10.1109/CDC49753.2023.10383516
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address a data-driven linear quadratic optimal control problem in which the regulator design is performed on-policy by resorting to approaches from reinforcement learning and model reference adaptive control. In particular, a continuous-time identifier of the value function is used to generate online a reference model for the adaptive stabilizer. By introducing a suitably selected dithering signal, the resulting policy is shown to achieve asymptotic convergence to the optimal gain while the controlled plant reaches asymptotically the behavior of the optimal closed-loop system.
引用
收藏
页码:32 / 37
页数:6
相关论文
共 50 条
  • [21] Secure Linear Quadratic Regulator Using Sparse Model-Free Reinforcement Learning
    Kiumarsi, Bahare
    Basar, Tamer
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3641 - 3647
  • [22] Reinforcement Learning for Adaptive Periodic Linear Quadratic Control
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3322 - 3327
  • [23] Learning the model-free linear quadratic regulator via random search
    Mohammadi, Hesameddin
    Soltanolkotabi, Mahdi
    Jovanovic, Mihailo R.
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 531 - 539
  • [24] A Data-Driven Pandemic Simulator with Reinforcement Learning
    Zhang, Yuting
    Ma, Biyang
    Cao, Langcai
    Liu, Yanyu
    ELECTRONICS, 2024, 13 (13)
  • [25] Integrating On-policy Reinforcement Learning with Multi-agent Techniques for Adaptive Service Composition
    Wang, Hongbing
    Chen, Xin
    Wu, Qin
    Yu, Qi
    Zheng, Zibin
    Bouguettaya, Athman
    SERVICE-ORIENTED COMPUTING, ICSOC 2014, 2014, 8831 : 154 - 168
  • [26] Adaptive average arterial pressure control by multi-agent on-policy reinforcement learning
    Hong, Xiaofeng
    Ayadi, Walid
    Alattas, Khalid A.
    Mohammadzadeh, Ardashir
    Salimi, Mohamad
    Zhang, Chunwei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [27] A Novel Adaptive Iterative Learning Control via Data-driven Approach
    Chi Ronghu
    Liu Xiaohe
    Hou Zhongsheng
    Chien Chiang-Ju
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3147 - 3151
  • [28] Adaptive Robust Control for Uncertain Systems via Data-Driven Learning
    Zhao, Jun
    Zeng, Qingliang
    JOURNAL OF SENSORS, 2022, 2022
  • [29] Data-driven torque and pitch control of wind turbines via reinforcement learning
    Xie, Jingjie
    Dong, Hongyang
    Zhao, Xiaowei
    RENEWABLE ENERGY, 2023, 215
  • [30] Data-driven Haptic Modeling of Plastic Flow via Inverse Reinforcement Learning
    Abdulali, Arsen
    Jeon, Seokhee
    2021 IEEE WORLD HAPTICS CONFERENCE (WHC), 2021, : 115 - 120