Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems

被引：14

作者：

Jiang, Huaiyuan ^{[1
]}

Zhou, Bin ^{[1
]}

机构：

[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, POB 416, Harbin 150001, Peoples R China

来源：

AUTOMATICA | 2022年 / 136卷

关键词：

Adaptive dynamic programming; Policy iteration; Unknown systems; Optimal control; Data-driven control; DESIGN;

D O I：

10.1016/j.automatica.2021.110058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a bias-policy iteration method for solving the data-driven optimal control problem of unknown continuous-time linear systems is proposed. Firstly, a model-based bias-policy iteration method is given and its convergence is rigorously proved. Then the data-driven implementation for the proposed method is then introduced without using the information of the system matrices. The relationship between the proposed method and the existing policy iteration method and value iteration method is also analyzed. Compared with the existing policy iteration method, the most significant advantage of the proposed method is that, by adding a bias parameter, the condition of the initial admissible controllers can be further relaxed. Simulation examples verify the effectiveness of the proposed bias-policy iteration method. (C) 2021 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[1] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
Li, Xiang
Jiang, Huaiyuan
Zhou, Bin
[J]. SYSTEMS & CONTROL LETTERS, 2024, 189
[2] Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
Wei, Qinglai
Zhou, Tianmin
Lu, Jingwei
Liu, Yu
Su, Shuai
Xiao, Jun
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (10): : 6375 - 6387
[3] Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems
Jiang, Huaiyuan
Zhou, Bin
Duan, Guang-Ren
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3291 - 3301
[4] Adaptive optimal control for continuous-time linear systems based on policy iteration
Vrabie, D.
Pastravanu, O.
Abu-Khalaf, M.
Lewis, F. L.
[J]. AUTOMATICA, 2009, 45 (02) : 477 - 484
[5] Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems
Jiang, Huaiyuan
Zhou, Bin
Duan, Guang-Ren
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (12) : 7149 - 7173
[6] Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆
Xie, Kedi
Zheng, Yiwei
Jiang, Yi
Lan, Weiyao
Yu, Xiao
[J]. AUTOMATICA, 2024, 163
[7] On Generalized Policy Iteration for Continuous-Time Linear Systems
Lee, Jae Young
Chun, Tae Yoon
Park, Jin Bae
Choi, Yoon Ho
[J]. 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1722 - 1728
[8] Policy iteration for continuous-time systems with unknown internal dynamics
Vrabie, D.
Pastravanu, O.
Lewis, F. L.
[J]. 2007 MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-4, 2007, : 34 - +
[9] Explorized policy iteration for continuous-time linear systems
Chun, Tae Yoon
Choi, Yoon Ho
Park, Jin Bae
[J]. Transactions of the Korean Institute of Electrical Engineers, 2012, 61 (03): : 451 - 458
[10] Adaptive optimal output regulation of unknown linear continuous-time systems by dynamic output feedback and value iteration
Xie, Kedi
Zheng, Yiwei
Lan, Weiyao
Yu, Xiao
[J]. CONTROL ENGINEERING PRACTICE, 2023, 141

← 1 2 3 4 5 →