Novel optimal trajectory tracking for nonlinear affine systems with an advanced critic learning structure

被引:2
|
作者
Wang, Ding [1 ]
Zhao, Huiling
Zhao, Mingming
Ren, Jin
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Discount factor; Dual heuristic dynamic programming; Neural networks; Optimal tracking control; Polynomial; Value iteration; DYNAMIC-PROGRAMMING ALGORITHM; VALUE-ITERATION; STABILITY ANALYSIS;
D O I
10.1016/j.neunet.2022.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a critic learning structure based on the novel utility function is developed to solve the optimal tracking control problem with the discount factor of affine nonlinear systems. The utility function is defined as the quadratic form of the error at the next moment, which can not only avoid solving the stable control input, but also effectively eliminate the tracking error. Next, the theoretical derivation of the method under value iteration is given in detail with convergence and stability analysis. Then, the dual heuristic dynamic programming (DHP) algorithm via a single neural network is introduced to reduce the amount of computation. The polynomial is used to approximate the costate function during the DHP implementation. The weighted residual method is used to update the weight matrix. During simulation, the convergence speed of the given strategy is compared with the heuristic dynamic programming (HDP) algorithm. The experiment results display that the convergence speed of the proposed method is faster than the HDP algorithm. Besides, the proposed method is compared with the traditional tracking control approach to verify its tracking performance. The experiment results show that the proposed method can avoid solving the stable control input, and the tracking error is closer to zero than the traditional strategy. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [41] Robust trajectory tracking of flat nonlinear systems
    Mahout, Vincent
    Bernussou, Jacques
    Khansah, Hael
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, VOLS 1-4, 2006, : 191 - +
  • [42] Actor-Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
    Kiumarsi, Bahare
    Lewis, Frank L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) : 140 - 151
  • [43] Optimal Tracking Control for Uncertain Nonlinear Systems With Prescribed Performance via Critic-Only ADP
    Dong, Hongyang
    Zhao, Xiaowei
    Luo, Biao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 561 - 573
  • [44] Optimal trajectory tracking control for a class of nonlinear nonaffine systems via generalized N-step value gradient learning
    Zhao, Mingming
    Wang, Ding
    Qiao, Junfei
    Hu, Lingzhi
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (06) : 3471 - 3490
  • [45] Trajectory tracking for boom cranes based on nonlinear control and optimal trajectory generation
    Arnold, Eckhard
    Neupert, Joerg
    Sawodny, Oliver
    Schneider, Klaus
    PROCEEDINGS OF THE 2007 IEEE CONFERENCE ON CONTROL APPLICATIONS, VOLS 1-3, 2007, : 985 - +
  • [46] General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    Qu, Qiuxia
    NEUROCOMPUTING, 2017, 245 : 114 - 123
  • [47] Adaptive Critic-Based Tracking Control of Non-Affine Nonlinear Discrete-time Systems with Unknown Dynamics
    Yang, Qinmin
    Sun, Youxian
    2011 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, 2011, : 2602 - 2607
  • [48] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [49] Reinforcement Learning-Based Predefined-Time Tracking Control for Nonlinear Systems Under Identifier-Critic-Actor Structure
    Wang, Jing
    Zhao, Wei
    Cao, Jinde
    Park, Ju H.
    Shen, Hao
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, : 6345 - 6357
  • [50] On learning wavelet control for affine nonlinear systems
    Xu, Jian-Xin
    Yan, Rui
    Wang, Wei
    2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2748 - +