Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

被引:7
|
作者
Cui, Leilei [1 ]
Pang, Bo [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Optimal control; Aerospace electronics; Mathematical models; Heuristic algorithms; Delays; Trajectory; Stability criteria; Adaptive dynamic programming (ADP); linear time-delay systems; optimal control; policy iteration (PI); MULTIAGENT SYSTEMS; RICCATI-EQUATIONS; REGULATOR; CONSENSUS;
D O I
10.1109/TAC.2023.3273786
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations. A crucial strategy is to take advantage of recent developments in reinforcement learning and adaptive dynamic programming and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this article, the data-driven policy iteration (PI) is proposed to solve the infinite-dimensional algebraic Riccati equation iteratively in the absence of exact model knowledge. Interestingly, the proposed recursive PI algorithm is new in the present context of continuous-time time-delay systems, even when the model knowledge is assumed known. The efficacy of the proposed learning-based control methods is validated by means of practical applications arising from metal cutting and autonomous driving.
引用
收藏
页码:629 / 636
页数:8
相关论文
共 50 条
  • [31] Adaptive H infinity control for a class of linear time-delay systems with input delay
    Department of Automation, Southeast University, Nanjing 210096, China
    Xitong Gongcheng Lilum yu Shijian, 2006, 3 (61-67):
  • [32] An Approach on Adaptive Time-delay Estimate and Compensation Control in Internet-based Control Systems
    Xiong, Naixue
    Li, Hongyan
    Kim, Tai-hoon
    Yang, Laurence T.
    FGCN: PROCEEDINGS OF THE 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING, VOLS 1 AND 2, 2008, : 114 - +
  • [33] Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    Vrabie, D.
    Lewis, F. L.
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 73 - 79
  • [34] Stochastic linear quadratic optimal control for continuous-time systems based on policy iteration
    College of Information Science and Engineering,, Northeastern University,, Shenyang
    110004, China
    不详
    110034, China
    Kongzhi yu Juece Control Decis, 9 (1674-1678):
  • [35] Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach
    He, Shuping
    Fang, Haiyang
    Zhang, Maoguang
    Liu, Fei
    Ding, Zhengtao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 549 - 558
  • [36] Optimal Control Approach for Robust Control Design of Uncertain Time-delay Systems
    Lin, Yu-Chen
    Lin, Chun-Liang
    ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, : 53 - 57
  • [37] Value Iteration and Adaptive Optimal Control for Linear Continuous-time Systems
    Bian, Tao
    Jiang, Zhong-Ping
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 53 - 58
  • [38] Adaptive Predictive Control of Time-Delay Systems
    Bobal, Vladimir
    Kubalcik, Marek
    Dostal, Petr
    Matejicek, Jakub
    NOSTRADAMUS: MODERN METHODS OF PREDICTION, MODELING AND ANALYSIS OF NONLINEAR SYSTEMS, 2013, 192 : 61 - 72
  • [39] COMPUTATION OF OPTIMAL CONTROL FOR TIME-DELAY SYSTEMS
    AGGARWAL, JK
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1970, AC15 (06) : 683 - &
  • [40] Adaptive control of a class of time-delay systems
    Evesque, S
    Annaswamy, AM
    Niculescu, S
    Dowling, AP
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2003, 125 (02): : 186 - 193