Model-Free Optimal Tracking Control of Nonlinear Input-Affine Discrete-Time Systems via an Iterative Deterministic Q-Learning Algorithm

被引：30

作者：

Song, Shijie ^{[1
]}

Zhu, Minglei ^{[1
]}

Dai, Xiaolin ^{[1
]}

Gong, Dawei ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Mech & Elect Engn, Chengdu 611731, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 01期

基金：

芬兰科学院;

关键词：

Heuristic algorithms; Q-learning; Nonlinear dynamical systems; Approximation algorithms; Iterative algorithms; Convergence; Artificial neural networks; Adaptive dynamic programming (ADP); neural network (NN); off-policy technique; optimal tracking control (OTC); CONTROL SCHEME; LINEAR-SYSTEMS;

D O I：

10.1109/TNNLS.2022.3178746

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this article, a novel model-free dynamic inversion-based Q-learning (DIQL) algorithm is proposed to solve the optimal tracking control (OTC) problem of unknown nonlinear input-affine discrete-time (DT) systems. Compared with the existing DIQL algorithm and the discount factor-based Q-learning (DFQL) algorithm, the proposed algorithm can eliminate the tracking error while ensuring that it is model-free and off-policy. First, a new deterministic Q-learning iterative scheme is presented, and based on this scheme, a model-based off-policy DIQL algorithm is designed. The advantage of this new scheme is that it can avoid the training of unusual data and improve data utilization, thereby saving computing resources. Simultaneously, the convergence and stability of the designed algorithm are analyzed, and the proof that adding probing noise into the behavior policy does not affect the convergence is presented. Then, by introducing neural networks (NNs), the model-free version of the designed algorithm is further proposed so that the OTC problem can be solved without any knowledge about the system dynamics. Finally, three simulation examples are given to demonstrate the effectiveness of the proposed algorithm.

引用

页码：999 / 1012

页数：14

共 50 条

[41] Costate-Supplement ADP for Model-Free Optimal Control of Discrete-Time Nonlinear Systems
Ye, Jun
Bian, Yougang
Luo, Biao
Hu, Manjiang
Xu, Biao
Ding, Rongjun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 45 - 59
[42] Optimal Tracking Control of Affine Nonlinear Discrete-time Systems with Unknown Internal Dynamics
Dierks, Travis
Jagannathan, S.
PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 6750 - 6755
[43] An optimal terminal iterative learning control approach for nonlinear discrete-time systems
Chi, R.-H. (ronghu_chi@hotmail.com), 2012, South China University of Technology (29):
[44] A Discrete-Time Stochastic Iterative Learning Control Algorithm for a Class of Nonlinear Systems
Saab, S. S.
CONTROL AND INTELLIGENT SYSTEMS, 2005, 33 (02) : 95 - 101
[45] Iterative learning control for a class of nonlinear discrete-time systems with multiple input delays
Li, Xiao-Dong
Chow, Tommy W. S.
Ho, John K. L.
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2008, 39 (04) : 361 - 369
[46] Reinforcement Q-Learning and Non-Zero-Sum Games Optimal Tracking Control for Discrete-Time Linear Multi-Input Systems
Zhao, Jin-Gang
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 277 - 282
[47] Model-free optimal control of discrete-time systems with additive and multiplicative noises
Lai, Jing
Xiong, Junlin
Shu, Zhan
AUTOMATICA, 2023, 147
[48] Finite-Time Model-Free Adaptive Control for Discrete-Time Nonlinear Systems
Weng, Yongpeng
Zhang, Qiuxia
Cao, Jinde
Yan, Huaicheng
Qi, Wenhai
Cheng, Jun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (11) : 4113 - 4117
[49] Intelligent-Critic-Based Tracking Control of Discrete-Time Input-Affine Systems and Approximation Error Analysis With Application Verification
Wang, Ding
Gao, Ning
Ha, Mingming
Zhao, Mingming
Wu, Junlong
Qiao, Junfei
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (08) : 4690 - 4701
[50] Model-free distributed optimal control for general discrete-time linear systems using reinforcement learning
Feng, Xinjun
Zhao, Zhiyun
Yang, Wen
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (09) : 5570 - 5589

← 1 2 3 4 5 →