Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems

被引:0
|
作者
Dierks, Travis [1 ]
Thumati, Balaje T. [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Nonlinear optimal control; heuristic dynamic programming; system identification; neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discrete time approximate dynamic programming (ADP) techniques have been widely used in the recent literature to determine the optimal or near optimal control policies for nonlinear systems. However, an inherent assumption of ADP requires at least partial knowledge of the system dynamics as well as the value of the controlled plant one step ahead. In this work, a novel approach to ADP is attempted while relaxing the need of the partial knowledge of the nonlinear system. The proposed methodology entails a two part process: online system identification and offline optimal control training. First, in the identification process, a neural network (NN) is tuned online to learn the complete plant dynamics and local asymptotic stability is shown under a mild assumption that the NN functional reconstruction errors lie within a small-gain type norm bounded conic sector. Then, using only the NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. Proof of convergence is demonstrated. Simulation results verify theoretical conjecture.
引用
收藏
页码:1368 / 1373
页数:6
相关论文
共 50 条
  • [1] Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence
    Zhang, Xin
    Zhang, Huaguang
    Sun, Qiuye
    Luo, Yanhong
    [J]. NEUROCOMPUTING, 2012, 91 : 48 - 55
  • [2] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    Wang, Ding
    Liu, Derong
    Wei, Qinglai
    Zhao, Dongbin
    Jin, Ning
    [J]. AUTOMATICA, 2012, 48 (08) : 1825 - 1832
  • [3] Dimension reduction based adaptive dynamic programming for optimal control of discrete-time nonlinear control-affine systems
    Li, Qiang
    Xu, Yunjun
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (11) : 2799 - 2811
  • [4] Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    [J]. NEUROCOMPUTING, 2015, 165 : 163 - 170
  • [5] Optimal Control for Unknown Discrete-Time Nonlinear Markov Jump Systems Using Adaptive Dynamic Programming
    Zhong, Xiangnan
    He, Haibo
    Zhang, Huaguang
    Wang, Zhanshan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) : 2141 - 2155
  • [6] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
    Jiahui Xu
    Jingcheng Wang
    Jun Rao
    Yanjiu Zhong
    Shangwei Zhao
    [J]. International Journal of Control, Automation and Systems, 2022, 20 : 3098 - 3109
  • [7] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Zhong, Yanjiu
    Zhao, Shangwei
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (09) : 3098 - 3109
  • [8] Policy Optimization Adaptive Dynamic Programming for Optimal Control of Input-Affine Discrete-Time Nonlinear Systems
    Lin, Mingduo
    Zhao, Bo
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4339 - 4350
  • [9] An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    Liu, Derong
    Wang, Ding
    Yang, Xiong
    [J]. INFORMATION SCIENCES, 2013, 220 : 331 - 342
  • [10] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    Lin, Hanquan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853