Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems

被引:0
|
作者
Dierks, Travis [1 ]
Thumati, Balaje T. [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
关键词
Nonlinear optimal control; heuristic dynamic programming; system identification; neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discrete time approximate dynamic programming (ADP) techniques have been widely used in the recent literature to determine the optimal or near optimal control policies for nonlinear systems. However, an inherent assumption of ADP requires at least partial knowledge of the system dynamics as well as the value of the controlled plant one step ahead. In this work, a novel approach to ADP is attempted while relaxing the need of the partial knowledge of the nonlinear system. The proposed methodology entails a two part process: online system identification and offline optimal control training. First, in the identification process, a neural network (NN) is tuned online to learn the complete plant dynamics and local asymptotic stability is shown under a mild assumption that the NN functional reconstruction errors lie within a small-gain type norm bounded conic sector. Then, using only the NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. Proof of convergence is demonstrated. Simulation results verify theoretical conjecture.
引用
收藏
页码:1368 / 1373
页数:6
相关论文
共 50 条
  • [21] Optimal Learning Control for Discrete-Time Nonlinear Systems Using Generalized Policy Iteration Based Adaptive Dynamic Programming
    Wei, Qinglai
    Liu, Derong
    [J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1781 - 1786
  • [22] Adaptive Optimal Control for Nonlinear Discrete-Time Systems
    Qin, Chunbin
    Zhang, Huaguang
    Luo, Yanhong
    [J]. PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 13 - 18
  • [23] Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control
    Zhu, Yuanheng
    Zhao, Dongbin
    He, Haibo
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 3959 - 3971
  • [24] Adaptive Dynamic Programming for Discrete-time LQR Optimal Tracking Control Problems with Unknown Dynamics
    Liu, Yang
    Luo, Yanhong
    Zhang, Huaguang
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 212 - 217
  • [25] MRAC for unknown discrete-time nonlinear systems based on supervised neural dynamic programming
    Fu, Hao
    Chen, Xin
    Wang, Wei
    Wu, Min
    [J]. NEUROCOMPUTING, 2020, 384 : 130 - 141
  • [26] A discrete-time multivariable neuro-adaptive control for nonlinear unknown dynamic systems
    Hwang, CL
    Lin, CH
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (06): : 865 - 877
  • [27] Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation
    Zhao, Bo
    Jia, Lihao
    Xia, Hongbing
    Li, Yuanchun
    [J]. NONLINEAR DYNAMICS, 2018, 93 (04) : 2089 - 2103
  • [28] Adaptive Dynamic Programming for a Class of Discrete-Time Non-Affine Nonlinear Systems with Time-Delays
    Liu, Derong
    Wei, Qinglai
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [29] Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation
    Bo Zhao
    Lihao Jia
    Hongbing Xia
    Yuanchun Li
    [J]. Nonlinear Dynamics, 2018, 93 : 2089 - 2103
  • [30] Adaptive Dynamic Programming for Finite-Horizon Optimal Control of Discrete-Time Nonlinear Systems with ε-Error Bound
    Wang, Fei-Yue
    Jin, Ning
    Liu, Derong
    Wei, Qinglai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (01): : 24 - 36