Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics

被引：42

作者：

Xiao, Geyang ^{[1
]}

Zhang, Huaguang ^{[1
,2
]}

Luo, Yanhong ^{[1
]}

Jiang, He ^{[1
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, POB 134, Shenyang 110819, Peoples R China

[2] Northeastern Univ, Natl Educ Minist, Key Lab Integrated Automat Proc Ind, Shenyang 110004, Peoples R China

来源：

IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 06期

基金：

中国国家自然科学基金; 国家高技术研究发展计划(863计划);

关键词：

ADAPTIVE OPTIMAL-CONTROL; CONTROL SCHEME; POLICY ITERATION; ALGORITHM;

D O I：

10.1049/iet-cta.2015.0590

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this study, the optimal tracking control problem (OTCP) for affine non-linear continuous-time systems with completely unknown dynamics is addressed based on data by introducing the reinforcement learning (RL) technique. Unlike existing methods to the OTCP, the proposed data-driven policy iteration (PI) method does not need to have or identify any knowledge of the system dynamics, including both drift dynamics and input dynamics. To carry out the proposed method, the original OTCP is pre-processed to construct an augmented system composed of the error system dynamics and the desired trajectory dynamics. Then, based on the augmented system, a data-driven PI, which introduces discount factor to solve the OTCP, is implemented on an actor-critic neural network (NN) structure by only using system data rather than the exact knowledge of system dynamics. Two NNs are used in the structure to generate the optimal cost and optimal control policy, respectively, and the weights are updated by a least-square approach which minimises the residual errors. The proposed method is an off-policy RL method, where the data can be arbitrarily sampled on the state and input domain. Finally, simulation results are provided to show the effectiveness of the proposed method.

引用

页码：700 / 710

页数：11

共 50 条

[21] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
Peng, Zhinan
Zhang, Jiefu
Hu, Jiangping
Huang, Rui
Ghosh, Bijoy Kumar
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (10)
[22] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
Zhinan Peng
Jiefu Zhang
Jiangping Hu
Rui Huang
Bijoy Kumar Ghosh
[J]. Science China Information Sciences, 2020, 63
[23] Robust Data-Driven Control Barrier Functions for Unknown Continuous Control Affine Systems
Jin, Zeyuan
Khajenejad, Mohammad
Yong, Sze Zheng
[J]. IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 1309 - 1314
[24] Data-Driven Based Optimal Output-Feedback Control of Continuous-Time Systems
Li, Zican
Wu, Tao
Na, Jing
Zhao, Jun
Gao, Guanbin
Herrmann, Guido
[J]. PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC), 2018,
[25] Data-Driven Optimal Control of Affine Systems: A Linear Programming Perspective
Martinelli, Andrea
Gargiani, Matilde
Draskovic, Marina
Lygeros, John
[J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 3092 - 3097
[26] Data-driven robust fault tolerant linear quadratic preview control of discrete-time linear systems with completely unknown dynamics
Han, Kezhen
Feng, Jian
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (01) : 49 - 59
[27] Data-driven optimal tracking control of switched linear systems
Xu, Yichao
Liu, Yang
Ruan, Qihua
Lou, Jungang
[J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 49
[28] Data-driven Iterative Learning Control for Continuous-Time Systems
Chu, Bing
Rapisarda, Paolo
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4626 - 4631
[29] Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems
Yang, Xiong
He, Haibo
Liu, Derong
Zhu, Yuanheng
[J]. IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (14): : 2307 - 2316
[30] Optimal Control of Affine Nonlinear Continuous-time Systems
Dierks, T.
Jagannathan, S.
[J]. 2010 AMERICAN CONTROL CONFERENCE, 2010, : 1568 - 1573

← 1 2 3 4 5 →