Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems

被引:2
|
作者
Jiang, Huaiyuan [1 ]
Zhou, Bin [1 ]
Duan, Guang-Ren [1 ]
机构
[1] Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); data-driven control; discrete-time systems; modified 1-policy iteration (1-PI); policy iteration; unknown systems; STABILIZATION;
D O I
10.1109/TNNLS.2023.3244934
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this article, the 1-policy iteration (1-PI) method for the optimal control problem of discrete-time linear systems is reconsidered and restated from a novel aspect. First, the traditional 1-PI method is recalled, and some new properties of the traditional 1-PI are proposed. Based on these new properties, a modified 1-PI algorithm is introduced with its convergence proven. Compared with the existing results, the initial con-dition is further relaxed. The data-driven implementation is then constructed with a new matrix rank condition for veri-fying the feasibility of the proposed data-driven implementation. A simulation example verifies the effectiveness of the proposed method.
引用
收藏
页码:3291 / 3301
页数:11
相关论文
共 50 条
  • [41] A Novel Iterative θ-Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (04) : 1176 - 1190
  • [42] Generalized Policy Iteration-based Reinforcement Learning Algorithm for Optimal Control of Unknown Discrete-time Systems
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    Liu, Xi
    Luo, Fangchao
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3650 - 3655
  • [43] Optimal tracking control for linear discrete-time stochastic system based on adaptive dynamic programming
    Wang, Fang
    Chen, Xin
    Wang, Wei
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 1398 - 1403
  • [44] Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems With Trajectory-Based Initial Control Policy
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Wu, Shunyu
    Zhong, Yanjiu
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (03): : 1489 - 1501
  • [45] An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    Liu, Derong
    Wang, Ding
    Yang, Xiong
    [J]. INFORMATION SCIENCES, 2013, 220 : 331 - 342
  • [46] Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm
    Li, Xiaofeng
    Xue, Lei
    Sun, Changyin
    [J]. NEUROCOMPUTING, 2018, 314 : 86 - 93
  • [47] Adaptive event-triggered control and observer design for discrete-time nonlinear Markov jump systems with DoS attacks using policy iteration-based adaptive dynamic programming
    Lu, Hongqian
    Xing, Haobo
    Zhou, Wuneng
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2024, 45 (05): : 2153 - 2175
  • [48] Adaptive Modified Input and State Estimation for Linear Discrete-Time System with Unknown Inputs
    Ding, Bo
    Fang, Huajing
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3630 - 3649
  • [49] Adaptive Modified Input and State Estimation for Linear Discrete-Time System with Unknown Inputs
    Bo Ding
    Huajing Fang
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 3630 - 3649
  • [50] Hamiltonian-driven Adaptive Dynamic Programming for Nonlinear Discrete-Time Dynamic Systems
    Yang, Yongliang
    Wunsch, Donald
    Yin, Yixin
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1339 - 1346