Online Solution to the Linear Quadratic Tracking Problem of Continuous-time Systems using Reinforcement Learning

被引：0

作者：

Modares, Hamidreza ^{[1
]}

Lewis, Frank L. ^{[1
]}

机构：

[1] Univ Texas Arlington, Res Inst, Ft Worth, TX 76118 USA

来源：

2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2013年

关键词：

ADAPTIVE OPTIMAL-CONTROL;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, reinforcement learning (RL) is employed to find a casual solution to the linear quadratic tracker (LQT) for continuous-time systems online in real time. Although several RL techniques are developed in the literature to solve the LQ regulator, to our knowledge, there is no rigorous result for using RL to solve the LQ tracker. This is mainly because of the requirement for computing a feedforward term in the tracker control which must be done in a noncausal manner backwards in time. To deal with this noncausality problem, an augmented system composed of the original system and the command generator dynamics is constructed, and an augmented LQT algebraic Riccati equation is derived for solving the LQT problem. In this formulation, one can apply RL techniques to solve the LQT problem, computing the feedforward term and the feedback term simultaneously online in real time. The convergence of the proposed online algorithms to the optimal control solution is verified. To show the efficiency of the proposed approach, a simulation example is provided.

引用

页码：3851 / 3856

页数：6

共 50 条

[1] Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
Modares, Hamidreza
Lewis, Frank L.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (11) : 3051 - 3056
[2] Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem
Rizvi, Syed Ali Asad
Lin, Zongli
[J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3417 - 3422
[3] Online Reinforcement Learning in Stochastic Continuous-Time Systems
Faradonbeh, Mohamad Kazem Shirani
Faradonbeh, Mohamad Sadegh Shirani
[J]. THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195 : 612 - 656
[4] Reinforcement Learning-Based Linear Quadratic Regulation of Continuous-Time Systems Using Dynamic Output Feedback
Rizvi, Syed Ali Asad
Lin, Zongli
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) : 4670 - 4679
[5] Infinite-time robust optimal output tracking of continuous-time linear systems using undiscounted reinforcement learning
Amirparast, Ali
Sani, S. Kamal Hosseini
[J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (14) : 2933 - 2951
[6] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
Tao Bian
Zhong-Ping Jiang
[J]. IEEE/CAA Journal of Automatica Sinica, 2019, 6 (02) : 433 - 440
[7] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
Bian, Tao
Jiang, Zhong-Ping
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (02) : 433 - 440
[8] Solution of the linear quadratic regulator problem of black box linear systems using reinforcement learning
Perrusquia, Adolfo
[J]. INFORMATION SCIENCES, 2022, 595 : 364 - 377
[9] OPTIMAL SCHEDULING OF ENTROPY REGULARIZER FOR CONTINUOUS-TIME LINEAR-QUADRATIC REINFORCEMENT LEARNING
Szpruch, Lukasz
Treetanthiploet, Tanut
Zhang, Yufei
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2024, 62 (01) : 135 - 166
[10] Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics
Zhu, Yuanheng
Zhao, Dongbin
Li, Xiangjun
[J]. IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12): : 1339 - 1347

← 1 2 3 4 5 →