Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics

被引:42
|
作者
Xiao, Geyang [1 ]
Zhang, Huaguang [1 ,2 ]
Luo, Yanhong [1 ]
Jiang, He [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, POB 134, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Natl Educ Minist, Key Lab Integrated Automat Proc Ind, Shenyang 110004, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 06期
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
ADAPTIVE OPTIMAL-CONTROL; CONTROL SCHEME; POLICY ITERATION; ALGORITHM;
D O I
10.1049/iet-cta.2015.0590
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, the optimal tracking control problem (OTCP) for affine non-linear continuous-time systems with completely unknown dynamics is addressed based on data by introducing the reinforcement learning (RL) technique. Unlike existing methods to the OTCP, the proposed data-driven policy iteration (PI) method does not need to have or identify any knowledge of the system dynamics, including both drift dynamics and input dynamics. To carry out the proposed method, the original OTCP is pre-processed to construct an augmented system composed of the error system dynamics and the desired trajectory dynamics. Then, based on the augmented system, a data-driven PI, which introduces discount factor to solve the OTCP, is implemented on an actor-critic neural network (NN) structure by only using system data rather than the exact knowledge of system dynamics. Two NNs are used in the structure to generate the optimal cost and optimal control policy, respectively, and the weights are updated by a least-square approach which minimises the residual errors. The proposed method is an off-policy RL method, where the data can be arbitrarily sampled on the state and input domain. Finally, simulation results are provided to show the effectiveness of the proposed method.
引用
收藏
页码:700 / 710
页数:11
相关论文
共 50 条
  • [21] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
    Peng, Zhinan
    Zhang, Jiefu
    Hu, Jiangping
    Huang, Rui
    Ghosh, Bijoy Kumar
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (10)
  • [22] Optimal containment control of continuous-time multi-agent systems with unknown disturbances using data-driven approach
    Zhinan Peng
    Jiefu Zhang
    Jiangping Hu
    Rui Huang
    Bijoy Kumar Ghosh
    [J]. Science China Information Sciences, 2020, 63
  • [23] Robust Data-Driven Control Barrier Functions for Unknown Continuous Control Affine Systems
    Jin, Zeyuan
    Khajenejad, Mohammad
    Yong, Sze Zheng
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 1309 - 1314
  • [24] Data-Driven Based Optimal Output-Feedback Control of Continuous-Time Systems
    Li, Zican
    Wu, Tao
    Na, Jing
    Zhao, Jun
    Gao, Guanbin
    Herrmann, Guido
    [J]. PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC), 2018,
  • [25] Data-Driven Optimal Control of Affine Systems: A Linear Programming Perspective
    Martinelli, Andrea
    Gargiani, Matilde
    Draskovic, Marina
    Lygeros, John
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 3092 - 3097
  • [26] Data-driven robust fault tolerant linear quadratic preview control of discrete-time linear systems with completely unknown dynamics
    Han, Kezhen
    Feng, Jian
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (01) : 49 - 59
  • [27] Data-driven optimal tracking control of switched linear systems
    Xu, Yichao
    Liu, Yang
    Ruan, Qihua
    Lou, Jungang
    [J]. NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 49
  • [28] Data-driven Iterative Learning Control for Continuous-Time Systems
    Chu, Bing
    Rapisarda, Paolo
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4626 - 4631
  • [29] Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems
    Yang, Xiong
    He, Haibo
    Liu, Derong
    Zhu, Yuanheng
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (14): : 2307 - 2316
  • [30] Optimal Control of Affine Nonlinear Continuous-time Systems
    Dierks, T.
    Jagannathan, S.
    [J]. 2010 AMERICAN CONTROL CONFERENCE, 2010, : 1568 - 1573