Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm

被引:20
|
作者
Li, Xiaofeng [1 ]
Xue, Lei [1 ]
Sun, Changyin [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming; Linear quadratic tracking; Reinforcement learning; Value iteration; ADAPTIVE OPTIMAL-CONTROL; ONLINE LEARNING CONTROL; FEEDBACK-CONTROL; REINFORCEMENT; DYNAMICS; DESIGN;
D O I
10.1016/j.neucom.2018.05.111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, an optimal tracking control scheme is proposed to solve the infinite-horizon linear quadratic tracking (LQT) problem using iterative adaptive dynamic programming (ADP) algorithm. The reference trajectory is assumed to be produced by a linear command generator. First, via system transformation, an augmented system composed of controlled system and command generator is constructed. Then we derive the Bellman equation in terms of the transformed system with discount factor in cost function. In order to avoid requirement for knowledge of system dynamics, the iterative ADP algorithm is introduced to solve the Bellman equation with convergence analysis. A novel approach based on controllability and observability analysis is presented to show the stability of tracking error. For facilitating the implementation of this iterative approach, three neural networks (NNs) are employed as parametric structures to identify the unknown system dynamics, approximate performance function and search control policy, respectively. Finally, a simulation example is included to verify the effectiveness of the proposed scheme. (C) 2018 Published by Elsevier B.V.
引用
收藏
页码:86 / 93
页数:8
相关论文
共 50 条
  • [1] Optimal State Tracking Control for Linear Discrete-time Systems Via Value Iteration
    Liu, Yingying
    Shi, Zhan
    Wang, Zhanshan
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 836 - 841
  • [2] Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm
    Hao, Longyan
    Wang, Chaoli
    Zhang, Guang
    Jing, Chonglin
    Shi, Yibo
    [J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2022, 53 (14) : 2845 - 2859
  • [3] Reinforcement Q-Learning Algorithm for H∞ Tracking Control of Unknown Discrete-Time Linear Systems
    Peng, Yunjian
    Chen, Qian
    Sun, Weijie
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4109 - 4122
  • [4] Tracking Control for Linear Discrete-Time Networked Control Systems With Unknown Dynamics and Dropout
    Jiang, Yi
    Fan, Jialu
    Chai, Tianyou
    Lewis, Frank L.
    Li, Jinna
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4607 - 4620
  • [5] Quadratic Control of Linear Discrete-time Positive Systems
    Krokavec, Dusan
    Filasova, Anna
    [J]. 2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 2879 - 2884
  • [6] Optimal control for discrete-time affine non-linear systems using general value iteration
    Li, H.
    Liu, D.
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (18): : 2725 - 2736
  • [7] Cooperative Tracking Control of Unknown Discrete-Time Linear Multiagent Systems Subject to Unknown External Disturbances
    Yang, Ruohan
    Liu, Lu
    Feng, Gang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (10) : 6516 - 6528
  • [8] On discrete-time linear quadratic control
    Czornik, A
    [J]. SYSTEMS & CONTROL LETTERS, 1999, 36 (02) : 101 - 107
  • [9] Adaptive tracking for discrete-time systems with the unknown control directions
    Ruan, Rong-Yao
    Pan, Ren-Liang
    Bi, Ping
    Li, Yong-Zhi
    Liu, Chun-Li
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 908 - 913
  • [10] Data-based stable value iteration optimal control for unknown discrete-time systems with time delays
    Ren, He
    Zhang, Huaguang
    Su, Hanguang
    Mu, Yunfei
    [J]. NEUROCOMPUTING, 2020, 382 : 96 - 105