Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems

被引:2
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
Duan, Guang-Ren [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); data-driven control; discrete-time systems; modified 1-policy iteration (1-PI); policy iteration; unknown systems; STABILIZATION;
D O I
10.1109/TNNLS.2023.3244934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this article, the 1-policy iteration (1-PI) method for the optimal control problem of discrete-time linear systems is reconsidered and restated from a novel aspect. First, the traditional 1-PI method is recalled, and some new properties of the traditional 1-PI are proposed. Based on these new properties, a modified 1-PI algorithm is introduced with its convergence proven. Compared with the existing results, the initial con-dition is further relaxed. The data-driven implementation is then constructed with a new matrix rank condition for veri-fying the feasibility of the proposed data-driven implementation. A simulation example verifies the effectiveness of the proposed method.
引用
收藏
页码:3291 / 3301
页数:11
相关论文
共 50 条
  • [31] Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration
    Qasem, Omar
    Gao, Weinan
    Gutierrez, Hector
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1141 - 1146
  • [32] Policy gradient adaptive dynamic programming for nonlinear discrete-time zero-sum games with unknown dynamics
    Mingduo Lin
    Bo Zhao
    Derong Liu
    [J]. Soft Computing, 2023, 27 : 5781 - 5795
  • [33] Policy gradient adaptive dynamic programming for nonlinear discrete-time zero-sum games with unknown dynamics
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    [J]. SOFT COMPUTING, 2023, 27 (09) : 5781 - 5795
  • [34] Discrete-Time Impulsive Adaptive Dynamic Programming
    Wei, Qinglai
    Song, Ruizhuo
    Liao, Zehua
    Li, Benkai
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (10) : 4293 - 4306
  • [35] Neural-network-based stochastic linear quadratic optimal tracking control scheme for unknown discrete-time systems using adaptive dynamic programming
    Xin Chen
    Fang Wang
    [J]. Control Theory and Technology, 2021, 19 : 315 - 327
  • [36] Neural-network-based stochastic linear quadratic optimal tracking control scheme for unknown discrete-time systems using adaptive dynamic programming
    Chen, Xin
    Wang, Fang
    [J]. CONTROL THEORY AND TECHNOLOGY, 2021, 19 (03) : 315 - 327
  • [37] An Adaptive Dynamic Programming Algorithm Based on ITF-OELM for Discrete-Time Systems
    Zhang, Xiaofei
    Ma, Hongbin
    Chen, Junyong
    Li, Weixue
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3006 - 3011
  • [38] Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Han, Liyuan
    Zhang, Tielin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 1846 - 1856
  • [39] A novel adaptive dynamic programming based on tracking error for nonlinear discrete-time systems
    Li, Chun
    Ding, Jinliang
    Lewis, Frank L.
    Chai, Tianyou
    [J]. AUTOMATICA, 2021, 129
  • [40] A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm
    Lin, Qiao
    Wei, Qinglai
    Liu, Derong
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2017, 48 (03) : 525 - 534