Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm

被引：20

作者：

Li, Xiaofeng ^{[1
]}

Xue, Lei ^{[1
]}

Sun, Changyin ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China

来源：

NEUROCOMPUTING | 2018年 / 314卷

基金：

中国国家自然科学基金;

关键词：

Adaptive dynamic programming; Linear quadratic tracking; Reinforcement learning; Value iteration; ADAPTIVE OPTIMAL-CONTROL; ONLINE LEARNING CONTROL; FEEDBACK-CONTROL; REINFORCEMENT; DYNAMICS; DESIGN;

D O I：

10.1016/j.neucom.2018.05.111

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, an optimal tracking control scheme is proposed to solve the infinite-horizon linear quadratic tracking (LQT) problem using iterative adaptive dynamic programming (ADP) algorithm. The reference trajectory is assumed to be produced by a linear command generator. First, via system transformation, an augmented system composed of controlled system and command generator is constructed. Then we derive the Bellman equation in terms of the transformed system with discount factor in cost function. In order to avoid requirement for knowledge of system dynamics, the iterative ADP algorithm is introduced to solve the Bellman equation with convergence analysis. A novel approach based on controllability and observability analysis is presented to show the stability of tracking error. For facilitating the implementation of this iterative approach, three neural networks (NNs) are employed as parametric structures to identify the unknown system dynamics, approximate performance function and search control policy, respectively. Finally, a simulation example is included to verify the effectiveness of the proposed scheme. (C) 2018 Published by Elsevier B.V.

引用

页码：86 / 93

页数：8

共 50 条

[1] Optimal State Tracking Control for Linear Discrete-time Systems Via Value Iteration
Liu, Yingying
Shi, Zhan
Wang, Zhanshan
[J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 836 - 841
[2] Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm
Hao, Longyan
Wang, Chaoli
Zhang, Guang
Jing, Chonglin
Shi, Yibo
[J]. INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2022, 53 (14) : 2845 - 2859
[3] Reinforcement Q-Learning Algorithm for H∞ Tracking Control of Unknown Discrete-Time Linear Systems
Peng, Yunjian
Chen, Qian
Sun, Weijie
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4109 - 4122
[4] Tracking Control for Linear Discrete-Time Networked Control Systems With Unknown Dynamics and Dropout
Jiang, Yi
Fan, Jialu
Chai, Tianyou
Lewis, Frank L.
Li, Jinna
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4607 - 4620
[5] Quadratic Control of Linear Discrete-time Positive Systems
Krokavec, Dusan
Filasova, Anna
[J]. 2018 EUROPEAN CONTROL CONFERENCE (ECC), 2018, : 2879 - 2884
[6] Optimal control for discrete-time affine non-linear systems using general value iteration
Li, H.
Liu, D.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (18): : 2725 - 2736
[7] Cooperative Tracking Control of Unknown Discrete-Time Linear Multiagent Systems Subject to Unknown External Disturbances
Yang, Ruohan
Liu, Lu
Feng, Gang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (10) : 6516 - 6528
[8] On discrete-time linear quadratic control
Czornik, A
[J]. SYSTEMS & CONTROL LETTERS, 1999, 36 (02) : 101 - 107
[9] Adaptive tracking for discrete-time systems with the unknown control directions
Ruan, Rong-Yao
Pan, Ren-Liang
Bi, Ping
Li, Yong-Zhi
Liu, Chun-Li
[J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 908 - 913
[10] Data-based stable value iteration optimal control for unknown discrete-time systems with time delays
Ren, He
Zhang, Huaguang
Su, Hanguang
Mu, Yunfei
[J]. NEUROCOMPUTING, 2020, 382 : 96 - 105

← 1 2 3 4 5 →