Adaptive Dynamic Programming with Stable Value Iteration Algorithm for Discrete-Time Nonlinear Systems

被引:0
|
作者
Wei, Qinglai [1 ]
Liu, Derong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new stable value iteration adaptive dynamic programming (ADP) algorithm, named "theta-ADP" algorithm, is proposed for solving the optimal control problems of infinite horizon discrete-time nonlinear systems. By introducing a parameter theta in the iterative ADP algorithm, it is proved that any of iterative control obtained in the proposed algorithm can stabilize the nonlinear system which overcomes the disadvantage of traditional value iteration algorithms. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative. theta-ADP algorithm. Finally, a simulation example is given to illustrate the performance of the proposed method.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems
    Liu, Derong
    Wei, Qinglai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (03) : 621 - 634
  • [2] Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Liu, Derong
    Lin, Hanquan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (03) : 840 - 853
  • [3] A novel stable value iteration-based approximate dynamic programming algorithm for discrete-time nonlinear systems
    Qu, Yan-Hua
    Wang, An-Na
    Lin, Sheng
    [J]. CHINESE PHYSICS B, 2018, 27 (01)
  • [4] A novel stable value iteration-based approximate dynamic programming algorithm for discrete-time nonlinear systems
    曲延华
    王安娜
    林盛
    [J]. Chinese Physics B, 2018, (01) : 232 - 239
  • [5] Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
    Liu, Derong
    Wei, Qinglai
    Yan, Pengfei
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (12): : 1577 - 1591
  • [6] Local Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Xu, Yancai
    Lin, Qiao
    Liu, Derong
    Song, Ruizhuo
    [J]. ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 148 - 153
  • [7] Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems
    Wei, Qinglai
    Liu, Derong
    [J]. NEURAL COMPUTING & APPLICATIONS, 2014, 24 (06): : 1355 - 1367
  • [8] Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems
    Qinglai Wei
    Derong Liu
    [J]. Neural Computing and Applications, 2014, 24 : 1355 - 1367
  • [9] Stable value iteration for two-player zero-sum game of discrete-time nonlinear systems based on adaptive dynamic programming
    Song, Ruizhuo
    Zhu, Liao
    [J]. NEUROCOMPUTING, 2019, 340 : 180 - 195
  • [10] A Generalized Policy Iteration Adaptive Dynamic Programming Algorithm for Optimal Control of Discrete-Time Nonlinear Systems with Actuator Saturation
    Lin, Qiao
    Wei, Qinglai
    Zhao, Bo
    [J]. ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 60 - 65