Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update

被引:204
|
作者
Dierks, Travis [1 ]
Jagannathan, Sarangapani [2 ]
机构
[1] DRS Sustainment Syst Inc, St Louis, MO 63121 USA
[2] Missouri Univ Sci & Technol, Dept Elect & Comp Engn, Rolla, MO 65409 USA
基金
美国国家科学基金会;
关键词
Hamilton-Jacobi-Bellman; online approximators; online nonlinear optimal control; time-based policy update; HORIZON OPTIMAL-CONTROL; NEURAL-NETWORKS; EXISTENCE; ITERATION;
D O I
10.1109/TNNLS.2012.2196708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, the Hamilton-Jacobi-Bellman equation is solved forward-in-time for the optimal control of a class of general affine nonlinear discrete-time systems without using value and policy iterations. The proposed approach, referred to as adaptive dynamic programming, uses two neural networks (NNs), to solve the infinite horizon optimal regulation control of affine nonlinear discrete-time systems in the presence of unknown internal dynamics and a known control coefficient matrix. One NN approximates the cost function and is referred to as the critic NN, while the second NN generates the control input and is referred to as the action NN. The cost function and policy are updated once at the sampling instant and thus the proposed approach can be referred to as time-based ADP. Novel update laws for tuning the unknown weights of the NNs online are derived. Lyapunov techniques are used to show that all signals are uniformly ultimately bounded and that the approximated control signal approaches the optimal control input with small bounded error over time. In the absence of disturbances, an optimal control is demonstrated. Simulation results are included to show the effectiveness of the approach. The end result is the systematic design of an optimal controller with guaranteed convergence that is suitable for hardware implementation.
引用
收藏
页码:1118 / 1129
页数:12
相关论文
共 50 条
  • [1] Optimal Tracking Control of Affine Nonlinear Discrete-time Systems with Unknown Internal Dynamics
    Dierks, Travis
    Jagannathan, S.
    [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 6750 - 6755
  • [2] Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    [J]. NEUROCOMPUTING, 2015, 165 : 163 - 170
  • [3] Optimal Control of Affine Nonlinear Discrete-time Systems
    Dierks, Travis
    Jagannthan, S.
    [J]. MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, : 1390 - 1395
  • [4] Adaptive Dynamic Programming-based Optimal Control of Unknown Affine Nonlinear Discrete-time Systems
    Dierks, Travis
    Thumati, Balaje T.
    Jagannathan, S.
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 1368 - 1373
  • [5] Learning Optimal Control Policy for Unknown Discrete-Time Systems
    Lai, Jing
    Xiong, Junlin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (11) : 4191 - 4195
  • [6] Online Adaptive Policy Learning Algorithm for H∞ State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems
    Zhang, Huaguang
    Qin, Chunbin
    Jiang, Bin
    Luo, Yanhong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2706 - 2718
  • [7] Online Adaptive Critic Robust Control of Discrete-Time Nonlinear Systems With Unknown Dynamics
    Fu, Hao
    Chen, Xin
    Wu, Min
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 1696 - 1701
  • [8] Sensitivity approach to optimal control for affine nonlinear discrete-time systems
    Tang, GY
    Xie, N
    Liu, P
    [J]. ASIAN JOURNAL OF CONTROL, 2005, 7 (04) : 448 - 454
  • [9] Constrained Optimal Control of Affine Nonlinear Discrete-Time Systems Using GHJB Method
    Cui, Lili
    Zhang, Huaguang
    Liu, Derong
    Kim, Yongsu
    [J]. ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 16 - +
  • [10] Discrete-time Control of Nonlinear Control-Affine Systems with Uncertain Dynamics
    Dongare, Abhijit
    Sanyal, Amit K.
    Hamrah, Reza
    [J]. 2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 389 - 394