Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems

被引:0
|
作者
Dierks, Travis [1 ]
Thumati, Balaje T. [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Nonlinear optimal control; heuristic dynamic programming; system identification; neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discrete time approximate dynamic programming (ADP) techniques have been widely used in the recent literature to determine the optimal or near optimal control policies for nonlinear systems. However, an inherent assumption of ADP requires at least partial knowledge of the system dynamics as well as the value of the controlled plant one step ahead. In this work, a novel approach to ADP is attempted while relaxing the need of the partial knowledge of the nonlinear system. The proposed methodology entails a two part process: online system identification and offline optimal control training. First, in the identification process, a neural network (NN) is tuned online to learn the complete plant dynamics and local asymptotic stability is shown under a mild assumption that the NN functional reconstruction errors lie within a small-gain type norm bounded conic sector. Then, using only the NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. Proof of convergence is demonstrated. Simulation results verify theoretical conjecture.
引用
收藏
页码:1368 / 1373
页数:6
相关论文
共 50 条
  • [41] Adaptive dynamic programming-based optimal control for nonlinear state constrained systems with input delay
    Jianfeng Wang
    Ping Zhang
    Yan Wang
    Zhicheng Ji
    [J]. Nonlinear Dynamics, 2023, 111 : 19133 - 19149
  • [42] Broad Learning System Approximation-Based Adaptive Optimal Control for Unknown Discrete-Time Nonlinear Systems
    Yuan, Liang'en
    Li, Tieshan
    Tong, Shaocheng
    Xiao, Yang
    Shan, Qihe
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5028 - 5038
  • [43] ε-Adaptive Dynamic Programming for Discrete-Time Systems
    Liu, Derong
    Jin, Ning
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1417 - 1424
  • [44] A novel adaptive dynamic programming based on tracking error for nonlinear discrete-time systems
    Li, Chun
    Ding, Jinliang
    Lewis, Frank L.
    Chai, Tianyou
    [J]. AUTOMATICA, 2021, 129
  • [45] Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Han, Liyuan
    Zhang, Tielin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 1846 - 1856
  • [46] Modified λ-Policy Iteration Based Adaptive Dynamic Programming for Unknown Discrete-Time Linear Systems
    Jiang, Huaiyuan
    Zhou, Bin
    Duan, Guang-Ren
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3291 - 3301
  • [47] Infinite-time stochastic linear quadratic optimal control for unknown discrete-time systems using adaptive dynamic programming approach
    Wang, Tao
    Zhang, Huaguang
    Luo, Yanhong
    [J]. NEUROCOMPUTING, 2016, 171 : 379 - 386
  • [48] Parallel Cross Entropy Policy Gradient Adaptive Dynamic Programming for Optimal Tracking Control of Discrete-Time Nonlinear Systems
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Zhong, Yanjiu
    Wu, Shunyu
    Sun, Qifang
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3809 - 3821
  • [49] Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming
    Wei Q.
    Liu D.
    [J]. Journal of Control Theory and Applications, 2011, 9 (3): : 381 - 390
  • [50] Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update
    Dierks, Travis
    Jagannathan, Sarangapani
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) : 1118 - 1129