Novel optimal trajectory tracking for nonlinear affine systems with an advanced critic learning structure

被引:2
|
作者
Wang, Ding [1 ]
Zhao, Huiling
Zhao, Mingming
Ren, Jin
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Discount factor; Dual heuristic dynamic programming; Neural networks; Optimal tracking control; Polynomial; Value iteration; DYNAMIC-PROGRAMMING ALGORITHM; VALUE-ITERATION; STABILITY ANALYSIS;
D O I
10.1016/j.neunet.2022.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a critic learning structure based on the novel utility function is developed to solve the optimal tracking control problem with the discount factor of affine nonlinear systems. The utility function is defined as the quadratic form of the error at the next moment, which can not only avoid solving the stable control input, but also effectively eliminate the tracking error. Next, the theoretical derivation of the method under value iteration is given in detail with convergence and stability analysis. Then, the dual heuristic dynamic programming (DHP) algorithm via a single neural network is introduced to reduce the amount of computation. The polynomial is used to approximate the costate function during the DHP implementation. The weighted residual method is used to update the weight matrix. During simulation, the convergence speed of the given strategy is compared with the heuristic dynamic programming (HDP) algorithm. The experiment results display that the convergence speed of the proposed method is faster than the HDP algorithm. Besides, the proposed method is compared with the traditional tracking control approach to verify its tracking performance. The experiment results show that the proposed method can avoid solving the stable control input, and the tracking error is closer to zero than the traditional strategy. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:131 / 140
页数:10
相关论文
共 50 条
  • [21] Adaptive Output Trajectory Tracking Control for a Class of Affine Nonlinear Discrete-Time Systems
    Wang, Zhuo
    Gao, Furong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 46 (03): : 326 - 333
  • [22] Discrete-time Inverse Optimal Control for Nonlinear Systems Trajectory Tracking
    Ornelas, Fernando
    Sanchez, Edgar N.
    Loukianov, Alexander G.
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4813 - 4818
  • [23] Restricted trajectory tracking in nonlinear systems
    García, RA
    Aguilar, JLM
    D'Attellis, CE
    LATIN AMERICAN APPLIED RESEARCH, 2001, 31 (01) : 7 - 15
  • [24] Online optimal tracking control of unknown nonlinear singularly perturbed systems using single network adaptive critic with improved learning
    Fu, Zhijun
    Ma, Bao
    Zhao, Dengfeng
    Yin, Yuming
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 8027 - 8041
  • [25] Optimal H. tracking control of nonlinear systems with zero-equilibrium-free via novel adaptive critic designs
    Peng, Zhinan
    Ji, Hanqi
    Zou, Chaobin
    Kuang, Yiqun
    Cheng, Hong
    Shi, Kaibo
    Ghosh, Bijoy Kumar
    NEURAL NETWORKS, 2023, 164 : 105 - 114
  • [26] Robust tracking control for nonlinear systems based on critic learning formulation with single network
    Huo Y.
    Wang D.
    Qiao J.-F.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3066 - 3074
  • [27] Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems with Experimental Validation
    Na, Jing
    Lv, Yongfeng
    Zhang, Kaiqiang
    Zhao, Jun
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2022, 52 (01) : 459 - 472
  • [28] Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems With Experimental Validation
    Na, Jing
    Lv, Yongfeng
    Zhang, Kaiqiang
    Zhao, Jun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 459 - 472
  • [29] Optimal Control of Affine Nonlinear Continuous-time Systems Using Online Actor-Critic Algorithm
    Chen Xue-song
    Yang Ming-sheng
    Liu Fu-chun
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2891 - 2894
  • [30] Novel learning framework for optimal multi-object video trajectory tracking
    Chen S.
    Hu X.
    Jiang W.
    Zhou W.
    Ding X.
    Virtual Reality and Intelligent Hardware, 2023, 5 (05): : 422 - 438