Finite-horizon near optimal adaptive control of uncertain linear discrete-time systems

被引:20
|
作者
Zhao, Qiming [1 ]
Xu, Hao [1 ]
Sarangapani, Jagannathan [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
来源
OPTIMAL CONTROL APPLICATIONS & METHODS | 2015年 / 36卷 / 06期
基金
美国国家科学基金会;
关键词
adaptive estimator; finite horizon; linear system; optimal control; Q-learning;
D O I
10.1002/oca.2143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the finite-horizon near optimal adaptive regulation of linear discrete-time systems with unknown system dynamics is presented in a forward-in-time manner by using adaptive dynamic programming and Q-learning. An adaptive estimator (AE) is introduced to relax the requirement of system dynamics, and it is tuned by using Q-learning. The time-varying solution to the Bellman equation in adaptive dynamic programming is handled by utilizing a time-dependent basis function, while the terminal constraint is incorporated as part of the update law of the AE. The Kalman gain is obtained by using the AE parameters, while the control input is calculated by using AE and the system state vector. Next, to relax the need for state availability, an adaptive observer is proposed so that the linear quadratic regulator design uses the reconstructed states and outputs. For the time-invariant linear discrete-time systems, the closed-loop dynamics becomes non-autonomous and involved but verified by using standard Lyapunov and geometric sequence theory. Effectiveness of the proposed approach is verified by using simulation results. The proposed linear quadratic regulator design for the uncertain linear system requires an initial admissible control input and yields a forward-in-time and online solution without needing value and/or policy iterations. Copyright (C) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:853 / 872
页数:20
相关论文
共 50 条
  • [41] Optimal guaranteed cost control for uncertain discrete-time linear systems
    Yu, Li
    Kongzhi Lilun Yu Yinyong/Control Theory and Applications, 1999, 16 (05): : 639 - 642
  • [42] Optimal guaranteed cost control of discrete-time uncertain linear systems
    Petersen, IR
    Mcfarlane, DC
    Rotea, MA
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 1998, 8 (08) : 649 - 657
  • [43] Data-driven finite-horizon optimal tracking control scheme for completely unknown discrete-time nonlinear systems
    Song, Ruizhuo
    Xie, Yulong
    Zhang, Zenglian
    NEUROCOMPUTING, 2019, 356 : 206 - 216
  • [44] Inverse linear-quadratic discrete-time finite-horizon optimal control for indistinguishable homogeneous agents: A convex optimization approach
    Zhang, Han
    Ringh, Axel
    AUTOMATICA, 2023, 148
  • [45] Finite Horizon Optimal Tracking Control for Nonlinear Discrete-Time Switched Systems
    Qin, Chunbin
    Liu, Xianxing
    Liu, Guoquan
    Wang, Jun
    Zhang, Dehua
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 801 - 810
  • [46] Finite Horizon Optimal Tracking Control for a Class of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Wang, Ding
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 620 - 629
  • [47] Deep reinforcement learning based finite-horizon optimal control for a discrete-time affine nonlinear system
    Kim, Jong Woo
    Park, Byung Jun
    Yoo, Haeun
    Lee, Jay H.
    Lee, Jong Min
    2018 57TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2018, : 567 - 572
  • [48] INFINITE HORIZON LINEAR QUADRATIC OPTIMAL CONTROL FOR DISCRETE-TIME STOCHASTIC SYSTEMS
    Huang, Yulin
    Zhang, Weihai
    Zhang, Huanshui
    ASIAN JOURNAL OF CONTROL, 2008, 10 (05) : 608 - 615
  • [49] A nested computational approach to the discrete-time finite-horizon LQ control problem
    Marro, G
    Prattichizzo, D
    Zattoni, E
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2003, 42 (03) : 1002 - 1012
  • [50] LOGARITHMIC TRANSFORMATIONS FOR DISCRETE-TIME, FINITE-HORIZON STOCHASTIC-CONTROL PROBLEMS
    ALBERTINI, F
    RUNGGALDIER, WJ
    APPLIED MATHEMATICS AND OPTIMIZATION, 1988, 18 (02): : 143 - 161