Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems

被引：0

作者：

Dierks, Travis ^{[1
]}

Thumati, Balaje T. ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA

来源：

IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6 | 2009年

关键词：

Nonlinear optimal control; heuristic dynamic programming; system identification; neural network;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Discrete time approximate dynamic programming (ADP) techniques have been widely used in the recent literature to determine the optimal or near optimal control policies for nonlinear systems. However, an inherent assumption of ADP requires at least partial knowledge of the system dynamics as well as the value of the controlled plant one step ahead. In this work, a novel approach to ADP is attempted while relaxing the need of the partial knowledge of the nonlinear system. The proposed methodology entails a two part process: online system identification and offline optimal control training. First, in the identification process, a neural network (NN) is tuned online to learn the complete plant dynamics and local asymptotic stability is shown under a mild assumption that the NN functional reconstruction errors lie within a small-gain type norm bounded conic sector. Then, using only the NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. Proof of convergence is demonstrated. Simulation results verify theoretical conjecture.

引用

页码：1368 / 1373

页数：6

共 50 条

[1] Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence
Zhang, Xin
Zhang, Huaguang
Sun, Qiuye
Luo, Yanhong
[J]. NEUROCOMPUTING, 2012, 91 : 48 - 55
[2] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
Wang, Ding
Liu, Derong
Wei, Qinglai
Zhao, Dongbin
Jin, Ning
[J]. AUTOMATICA, 2012, 48 (08) : 1825 - 1832
[3] Dimension reduction based adaptive dynamic programming for optimal control of discrete-time nonlinear control-affine systems
Li, Qiang
Xu, Yunjun
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (11) : 2799 - 2811
[4] Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming
Xiao, Geyang
Zhang, Huaguang
Luo, Yanhong
[J]. NEUROCOMPUTING, 2015, 165 : 163 - 170
[5] Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
Zhong, Xiangnan
He, Haibo
Zhang, Huaguang
Wang, Zhanshan
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2141 - 2155
[6] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
Jiahui Xu
Jingcheng Wang
Jun Rao
Yanjiu Zhong
Shangwei Zhao
[J]. International Journal of Control, Automation and Systems, 2022, 20 : 3098 - 3109
[7] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
Xu, Jiahui
Wang, Jingcheng
Rao, Jun
Zhong, Yanjiu
Zhao, Shangwei
[J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (09) : 3098 - 3109
[8] Policy Optimization Adaptive Dynamic Programming for Optimal Control of Input-Affine Discrete-Time Nonlinear Systems
Lin, Mingduo
Zhao, Bo
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4339 - 4350
[9] An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
Liu, Derong
Wang, Ding
Yang, Xiong
[J]. INFORMATION SCIENCES, 2013, 220 : 331 - 342
[10] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
Wei, Qinglai
Liu, Derong
Lin, Hanquan
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853

← 1 2 3 4 5 →