Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems

被引:14
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, POB 416, Harbin 150001, Peoples R China
关键词
Adaptive dynamic programming; Policy iteration; Unknown systems; Optimal control; Data-driven control; DESIGN;
D O I
10.1016/j.automatica.2021.110058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a bias-policy iteration method for solving the data-driven optimal control problem of unknown continuous-time linear systems is proposed. Firstly, a model-based bias-policy iteration method is given and its convergence is rigorously proved. Then the data-driven implementation for the proposed method is then introduced without using the information of the system matrices. The relationship between the proposed method and the existing policy iteration method and value iteration method is also analyzed. Compared with the existing policy iteration method, the most significant advantage of the proposed method is that, by adding a bias parameter, the condition of the initial admissible controllers can be further relaxed. Simulation examples verify the effectiveness of the proposed bias-policy iteration method. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Input-Derivative-Constrained Approximate Dynamic Programming For Unknown Continuous-Time Linear Systems
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    [J]. ISIE: 2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, 2009, : 1137 - 1142
  • [22] On approximate policy iteration for continuous-time systems
    Wernrud, Andreas
    Rantzer, Anders
    [J]. 2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1453 - 1458
  • [23] Generalized Policy Iteration for Continuous-Time Systems
    Vrabie, Draguna
    Lewis, Frank L.
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 2677 - 2684
  • [24] Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (11) : 2917 - 2929
  • [25] Finite horizon optimal tracking control of partially unknown linear continuous-time systems using policy iteration
    Li, Chao
    Liu, Derong
    Li, Hongliang
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2015, 9 (12): : 1791 - 1801
  • [26] Adaptive output-feedback optimal control for continuous-time linear systems based on adaptive dynamic programming approach
    Shi, Zhan
    Wang, Zhanshan
    [J]. NEUROCOMPUTING, 2021, 438 : 334 - 344
  • [27] Linear-Like Policy Iteration Based Optimal Control for Continuous-Time Nonlinear Systems
    Tahirovic, Adnan
    Astolfi, Alessandro
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (10) : 5837 - 5849
  • [28] Adaptive dynamic programming based event-triggered control for unknown continuous-time nonlinear systems with input constraints
    Xue, Shan
    Luo, Biao
    Liu, Derong
    Li, Yueheng
    [J]. NEUROCOMPUTING, 2020, 396 (396) : 191 - 200
  • [29] Event-triggered control based on adaptive dynamic programming for continuous-time nonlinear systems with completely unknown dynamics
    Shi, Jing
    Yue, Dong
    Yang, Yang
    Hu, Songlin
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2035 - 2040
  • [30] Optimal Control for Continuous-time Nonlinear Systems based on a Linear-like Policy Iteration
    Tahirovic, Adnan
    Astolfi, Alessandro
    [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5238 - 5243