Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems

被引:14
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, POB 416, Harbin 150001, Peoples R China
关键词
Adaptive dynamic programming; Policy iteration; Unknown systems; Optimal control; Data-driven control; DESIGN;
D O I
10.1016/j.automatica.2021.110058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a bias-policy iteration method for solving the data-driven optimal control problem of unknown continuous-time linear systems is proposed. Firstly, a model-based bias-policy iteration method is given and its convergence is rigorously proved. Then the data-driven implementation for the proposed method is then introduced without using the information of the system matrices. The relationship between the proposed method and the existing policy iteration method and value iteration method is also analyzed. Compared with the existing policy iteration method, the most significant advantage of the proposed method is that, by adding a bias parameter, the condition of the initial admissible controllers can be further relaxed. Simulation examples verify the effectiveness of the proposed bias-policy iteration method. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
    Li, Xiang
    Jiang, Huaiyuan
    Zhou, Bin
    [J]. SYSTEMS & CONTROL LETTERS, 2024, 189
  • [2] Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
    Wei, Qinglai
    Zhou, Tianmin
    Lu, Jingwei
    Liu, Yu
    Su, Shuai
    Xiao, Jun
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (10): : 6375 - 6387
  • [3] Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems
    Jiang, Huaiyuan
    Zhou, Bin
    Duan, Guang-Ren
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3291 - 3301
  • [4] Adaptive optimal control for continuous-time linear systems based on policy iteration
    Vrabie, D.
    Pastravanu, O.
    Abu-Khalaf, M.
    Lewis, F. L.
    [J]. AUTOMATICA, 2009, 45 (02) : 477 - 484
  • [5] Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems
    Jiang, Huaiyuan
    Zhou, Bin
    Duan, Guang-Ren
    [J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (12) : 7149 - 7173
  • [6] Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆
    Xie, Kedi
    Zheng, Yiwei
    Jiang, Yi
    Lan, Weiyao
    Yu, Xiao
    [J]. AUTOMATICA, 2024, 163
  • [7] On Generalized Policy Iteration for Continuous-Time Linear Systems
    Lee, Jae Young
    Chun, Tae Yoon
    Park, Jin Bae
    Choi, Yoon Ho
    [J]. 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1722 - 1728
  • [8] Policy iteration for continuous-time systems with unknown internal dynamics
    Vrabie, D.
    Pastravanu, O.
    Lewis, F. L.
    [J]. 2007 MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-4, 2007, : 34 - +
  • [9] Explorized policy iteration for continuous-time linear systems
    Chun, Tae Yoon
    Choi, Yoon Ho
    Park, Jin Bae
    [J]. Transactions of the Korean Institute of Electrical Engineers, 2012, 61 (03): : 451 - 458
  • [10] Adaptive optimal output regulation of unknown linear continuous-time systems by dynamic output feedback and value iteration
    Xie, Kedi
    Zheng, Yiwei
    Lan, Weiyao
    Yu, Xiao
    [J]. CONTROL ENGINEERING PRACTICE, 2023, 141