Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

被引：7

作者：

Cui, Leilei ^{[1
]}

Pang, Bo ^{[1
]}

Jiang, Zhong-Ping ^{[1
]}

机构：

[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, Brooklyn, NY 11201 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2024年 / 69卷 / 01期

基金：

美国国家科学基金会;

关键词：

Optimal control; Aerospace electronics; Mathematical models; Heuristic algorithms; Delays; Trajectory; Stability criteria; Adaptive dynamic programming (ADP); linear time-delay systems; optimal control; policy iteration (PI); MULTIAGENT SYSTEMS; RICCATI-EQUATIONS; REGULATOR; CONSENSUS;

D O I：

10.1109/TAC.2023.3273786

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations. A crucial strategy is to take advantage of recent developments in reinforcement learning and adaptive dynamic programming and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this article, the data-driven policy iteration (PI) is proposed to solve the infinite-dimensional algebraic Riccati equation iteratively in the absence of exact model knowledge. Interestingly, the proposed recursive PI algorithm is new in the present context of continuous-time time-delay systems, even when the model knowledge is assumed known. The efficacy of the proposed learning-based control methods is validated by means of practical applications arising from metal cutting and autonomous driving.

引用

页码：629 / 636

页数：8

共 50 条

[41] Robust control of uncertain time-delay systems - A minimax optimal approach
Moheimani, SOR
Savkin, AV
Petersen, IR
PROCEEDINGS OF THE 35TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1996, : 1362 - 1367
[42] Adaptive predictive control of time-delay systems
Bobal, Vladimir
Kubalcik, Marek
Dostal, Petr
Matejicek, Jakub
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2013, 66 (02) : 165 - 176
[43] OPTIMAL CONTROL APPROXIMATIONS FOR TIME-DELAY SYSTEMS
HESS, RA
AIAA JOURNAL, 1972, 10 (11) : 1536 - +
[44] Learning-Based Adaptive Optimal Output Regulation of Discrete-Time Linear Systems
Chakraborty, Sayan
Gao, Weinan
Cui, Leilei
Lewis, Frank L.
Jiang, Zhong-Ping
IFAC PAPERSONLINE, 2023, 56 (02): : 10283 - 10288
[45] Adaptive integral control of time-delay systems
Logemann, H
Townley, S
IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 1997, 144 (06): : 531 - 536
[46] BMI Approach to Cooperative Control of Linear Systems with Uncertainty and Time-delay
Deng Xiaofei
Nian Xiaohong
Huang Guangya
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 1480 - 1485
[47] On　the　H∞　Control　for　Linear　Time-Delay　Systems
Chen Wanyi (College of Mathematics
Journal of Systems Engineering and Electronics, 1998, (03) : 23 - 28
[48] H∞ control of linear uncertain time-delay systems -: A projection approach
Suplin, V
Fridman, E
Shaked, U
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (04) : 680 - 685
[49] A reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems
Wong, Wee Chin
Lee, Jay H.
OPTIMAL CONTROL APPLICATIONS & METHODS, 2010, 31 (04): : 365 - 374
[50] A descriptor system approach to H∞ control of linear time-delay systems
Fridman, E
Shaked, U
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (02) : 253 - 270

← 1 2 3 4 5 →