Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics

被引:42
|
作者
Xiao, Geyang [1 ]
Zhang, Huaguang [1 ,2 ]
Luo, Yanhong [1 ]
Jiang, He [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, POB 134, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Natl Educ Minist, Key Lab Integrated Automat Proc Ind, Shenyang 110004, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2016年 / 10卷 / 06期
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
ADAPTIVE OPTIMAL-CONTROL; CONTROL SCHEME; POLICY ITERATION; ALGORITHM;
D O I
10.1049/iet-cta.2015.0590
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, the optimal tracking control problem (OTCP) for affine non-linear continuous-time systems with completely unknown dynamics is addressed based on data by introducing the reinforcement learning (RL) technique. Unlike existing methods to the OTCP, the proposed data-driven policy iteration (PI) method does not need to have or identify any knowledge of the system dynamics, including both drift dynamics and input dynamics. To carry out the proposed method, the original OTCP is pre-processed to construct an augmented system composed of the error system dynamics and the desired trajectory dynamics. Then, based on the augmented system, a data-driven PI, which introduces discount factor to solve the OTCP, is implemented on an actor-critic neural network (NN) structure by only using system data rather than the exact knowledge of system dynamics. Two NNs are used in the structure to generate the optimal cost and optimal control policy, respectively, and the weights are updated by a least-square approach which minimises the residual errors. The proposed method is an off-policy RL method, where the data can be arbitrarily sampled on the state and input domain. Finally, simulation results are provided to show the effectiveness of the proposed method.
引用
收藏
页码:700 / 710
页数:11
相关论文
共 50 条
  • [1] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2012, 48 (10) : 2699 - 2704
  • [2] Data-Driven Adaptive Optimal Tracking Control for Completely Unknown Systems
    Hou, Dawei
    Na, Jing
    Gao, Guanbin
    Li, Guang
    [J]. PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS), 2018, : 1039 - 1044
  • [3] Data-Driven Adaptive Dynamic Programming for Optimal Control of Continuous-Time Multicontroller Systems With Unknown Dynamics
    Zhao, Jingang
    [J]. IEEE ACCESS, 2022, 10 : 41503 - 41511
  • [4] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
    Derong Liu
    Xiong Yang
    Hongliang Li
    [J]. Neural Computing and Applications, 2013, 23 : 1843 - 1850
  • [5] Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
    Liu, Derong
    Yang, Xiong
    Li, Hongliang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (7-8): : 1843 - 1850
  • [6] Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics
    Lv, Yongfeng
    Na, Jing
    Yang, Qinmin
    Wu, Xing
    Guo, Yu
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2016, 89 (01) : 99 - 112
  • [7] Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems
    Mu, Chaoxu
    Ni, Zhen
    Sun, Changyin
    He, Haibo
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1460 - 1470
  • [8] Data-Driven Control for a Class of Non-Linear MIMO Systems
    Bakr, Mohamed
    Datar, Adwait
    Gonzalez, Antonio Mendez
    Werner, Herbert
    [J]. 2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 1174 - 1179
  • [9] Data-driven sliding mode tracking control for unknown Markovian jump non-linear systems
    Niu, Xiaoru
    Gao, Xianwen
    Weng, Yongpeng
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (16): : 2716 - 2723
  • [10] ON TRACKING DOMAINS OF CONTINUOUS-TIME NON-LINEAR CONTROL-SYSTEMS
    GRUJIC, LT
    [J]. RAIRO-AUTOMATIQUE-SYSTEMS ANALYSIS AND CONTROL, 1982, 16 (04): : 311 - 327