On-Policy Data-Driven Linear Quadratic Regulator via Model Reference Adaptive Reinforcement Learning

被引：0

作者：

Borghesi, Marco ^{[1
]}

Bosso, Alessandro ^{[1
]}

Notarstefano, Giuseppe ^{[1
]}

机构：

[1] Alma Mater Studiorum Univ, Dept Elect Elect & Informat Engn, Bologna, Italy

来源：

2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年

关键词：

SYSTEMS;

D O I：

10.1109/CDC49753.2023.10383516

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we address a data-driven linear quadratic optimal control problem in which the regulator design is performed on-policy by resorting to approaches from reinforcement learning and model reference adaptive control. In particular, a continuous-time identifier of the value function is used to generate online a reference model for the adaptive stabilizer. By introducing a suitably selected dithering signal, the resulting policy is shown to achieve asymptotic convergence to the optimal gain while the controlled plant reaches asymptotically the behavior of the optimal closed-loop system.

引用

页码：32 / 37

页数：6

共 50 条

[21] Secure Linear Quadratic Regulator Using Sparse Model-Free Reinforcement Learning
Kiumarsi, Bahare
Basar, Tamer
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3641 - 3647
[22] Reinforcement Learning for Adaptive Periodic Linear Quadratic Control
Pang, Bo
Jiang, Zhong-Ping
Mareels, Iven
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 3322 - 3327
[23] Learning the model-free linear quadratic regulator via random search
Mohammadi, Hesameddin
Soltanolkotabi, Mahdi
Jovanovic, Mihailo R.
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 531 - 539
[24] A Data-Driven Pandemic Simulator with Reinforcement Learning
Zhang, Yuting
Ma, Biyang
Cao, Langcai
Liu, Yanyu
ELECTRONICS, 2024, 13 (13)
[25] Integrating On-policy Reinforcement Learning with Multi-agent Techniques for Adaptive Service Composition
Wang, Hongbing
Chen, Xin
Wu, Qin
Yu, Qi
Zheng, Zibin
Bouguettaya, Athman
SERVICE-ORIENTED COMPUTING, ICSOC 2014, 2014, 8831 : 154 - 168
[26] Adaptive average arterial pressure control by multi-agent on-policy reinforcement learning
Hong, Xiaofeng
Ayadi, Walid
Alattas, Khalid A.
Mohammadzadeh, Ardashir
Salimi, Mohamad
Zhang, Chunwei
SCIENTIFIC REPORTS, 2025, 15 (01):
[27] A Novel Adaptive Iterative Learning Control via Data-driven Approach
Chi Ronghu
Liu Xiaohe
Hou Zhongsheng
Chien Chiang-Ju
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3147 - 3151
[28] Adaptive Robust Control for Uncertain Systems via Data-Driven Learning
Zhao, Jun
Zeng, Qingliang
JOURNAL OF SENSORS, 2022, 2022
[29] Data-driven torque and pitch control of wind turbines via reinforcement learning
Xie, Jingjie
Dong, Hongyang
Zhao, Xiaowei
RENEWABLE ENERGY, 2023, 215
[30] Data-driven Haptic Modeling of Plastic Flow via Inverse Reinforcement Learning
Abdulali, Arsen
Jeon, Seokhee
2021 IEEE WORLD HAPTICS CONFERENCE (WHC), 2021, : 115 - 120

← 1 2 3 4 5 →