Novel optimal trajectory tracking for nonlinear affine systems with an advanced critic learning structure

被引：2

作者：

Wang, Ding ^{[1
]}

Zhao, Huiling

Zhao, Mingming

Ren, Jin

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

NEURAL NETWORKS | 2022年 / 154卷

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Discount factor; Dual heuristic dynamic programming; Neural networks; Optimal tracking control; Polynomial; Value iteration; DYNAMIC-PROGRAMMING ALGORITHM; VALUE-ITERATION; STABILITY ANALYSIS;

D O I：

10.1016/j.neunet.2022.07.019

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a critic learning structure based on the novel utility function is developed to solve the optimal tracking control problem with the discount factor of affine nonlinear systems. The utility function is defined as the quadratic form of the error at the next moment, which can not only avoid solving the stable control input, but also effectively eliminate the tracking error. Next, the theoretical derivation of the method under value iteration is given in detail with convergence and stability analysis. Then, the dual heuristic dynamic programming (DHP) algorithm via a single neural network is introduced to reduce the amount of computation. The polynomial is used to approximate the costate function during the DHP implementation. The weighted residual method is used to update the weight matrix. During simulation, the convergence speed of the given strategy is compared with the heuristic dynamic programming (HDP) algorithm. The experiment results display that the convergence speed of the proposed method is faster than the HDP algorithm. Besides, the proposed method is compared with the traditional tracking control approach to verify its tracking performance. The experiment results show that the proposed method can avoid solving the stable control input, and the tracking error is closer to zero than the traditional strategy. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页码：131 / 140

页数：10

共 50 条

[41] Robust trajectory tracking of flat nonlinear systems
Mahout, Vincent
Bernussou, Jacques
Khansah, Hael
PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, VOLS 1-4, 2006, : 191 - +
[42] Actor-Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
Kiumarsi, Bahare
Lewis, Frank L.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (01) : 140 - 151
[43] Optimal Tracking Control for Uncertain Nonlinear Systems With Prescribed Performance via Critic-Only ADP
Dong, Hongyang
Zhao, Xiaowei
Luo, Biao
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 561 - 573
[44] Optimal trajectory tracking control for a class of nonlinear nonaffine systems via generalized N-step value gradient learning
Zhao, Mingming
Wang, Ding
Qiao, Junfei
Hu, Lingzhi
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (06) : 3471 - 3490
[45] Trajectory tracking for boom cranes based on nonlinear control and optimal trajectory generation
Arnold, Eckhard
Neupert, Joerg
Sawodny, Oliver
Schneider, Klaus
PROCEEDINGS OF THE 2007 IEEE CONFERENCE ON CONTROL APPLICATIONS, VOLS 1-3, 2007, : 985 - +
[46] General value iteration based reinforcement learning for solving optimal :tracking control problem of continuous-time affine nonlinear systems
Xiao, Geyang
Zhang, Huaguang
Luo, Yanhong
Qu, Qiuxia
NEUROCOMPUTING, 2017, 245 : 114 - 123
[47] Adaptive Critic-Based Tracking Control of Non-Affine Nonlinear Discrete-time Systems with Unknown Dynamics
Yang, Qinmin
Sun, Youxian
2011 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, 2011, : 2602 - 2607
[48] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
AUTOMATICA, 2013, 49 (01) : 82 - 92
[49] Reinforcement Learning-Based Predefined-Time Tracking Control for Nonlinear Systems Under Identifier-Critic-Actor Structure
Wang, Jing
Zhao, Wei
Cao, Jinde
Park, Ju H.
Shen, Hao
IEEE TRANSACTIONS ON CYBERNETICS, 2024, : 6345 - 6357
[50] On learning wavelet control for affine nonlinear systems
Xu, Jian-Xin
Yan, Rui
Wang, Wei
2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2748 - +

← 1 2 3 4 5 →