Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems

被引：14

作者：

Jiang, Huaiyuan ^{[1
]}

Zhou, Bin ^{[1
]}

机构：

[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, POB 416, Harbin 150001, Peoples R China

来源：

AUTOMATICA | 2022年 / 136卷

关键词：

Adaptive dynamic programming; Policy iteration; Unknown systems; Optimal control; Data-driven control; DESIGN;

D O I：

10.1016/j.automatica.2021.110058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a bias-policy iteration method for solving the data-driven optimal control problem of unknown continuous-time linear systems is proposed. Firstly, a model-based bias-policy iteration method is given and its convergence is rigorously proved. Then the data-driven implementation for the proposed method is then introduced without using the information of the system matrices. The relationship between the proposed method and the existing policy iteration method and value iteration method is also analyzed. Compared with the existing policy iteration method, the most significant advantage of the proposed method is that, by adding a bias parameter, the condition of the initial admissible controllers can be further relaxed. Simulation examples verify the effectiveness of the proposed bias-policy iteration method. (C) 2021 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[21] Input-Derivative-Constrained Approximate Dynamic Programming For Unknown Continuous-Time Linear Systems
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
[J]. ISIE: 2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, 2009, : 1137 - 1142
[22] On approximate policy iteration for continuous-time systems
Wernrud, Andreas
Rantzer, Anders
[J]. 2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 1453 - 1458
[23] Generalized Policy Iteration for Continuous-Time Systems
Vrabie, Draguna
Lewis, Frank L.
[J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 2677 - 2684
[24] Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems
Jiang, Yu
Jiang, Zhong-Ping
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (11) : 2917 - 2929
[25] Finite horizon optimal tracking control of partially unknown linear continuous-time systems using policy iteration
Li, Chao
Liu, Derong
Li, Hongliang
[J]. IET CONTROL THEORY AND APPLICATIONS, 2015, 9 (12): : 1791 - 1801
[26] Adaptive output-feedback optimal control for continuous-time linear systems based on adaptive dynamic programming approach
Shi, Zhan
Wang, Zhanshan
[J]. NEUROCOMPUTING, 2021, 438 : 334 - 344
[27] Linear-Like Policy Iteration Based Optimal Control for Continuous-Time Nonlinear Systems
Tahirovic, Adnan
Astolfi, Alessandro
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (10) : 5837 - 5849
[28] Adaptive dynamic programming based event-triggered control for unknown continuous-time nonlinear systems with input constraints
Xue, Shan
Luo, Biao
Liu, Derong
Li, Yueheng
[J]. NEUROCOMPUTING, 2020, 396 (396) : 191 - 200
[29] Event-triggered control based on adaptive dynamic programming for continuous-time nonlinear systems with completely unknown dynamics
Shi, Jing
Yue, Dong
Yang, Yang
Hu, Songlin
[J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2035 - 2040
[30] Optimal Control for Continuous-time Nonlinear Systems based on a Linear-like Policy Iteration
Tahirovic, Adnan
Astolfi, Alessandro
[J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5238 - 5243

← 1 2 3 4 5 →