Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation

被引:14
|
作者
Ha, Mingming [1 ,2 ]
Wang, Ding [3 ]
Liu, Derong [4 ,5 ]
机构
[1] Ant Grp, MYbank, Beijing 100020, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automation & Elect Engn, Beijing 100083, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[4] Southern Univ Sci & Technol, Sch Syst Design & Intelligent Mfg, Shenzhen 518055, Peoples R China
[5] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Iterative methods; Convergence; Power system stability; Optimal control; Stability criteria; Cost function; Closed loop systems; Adaptive critic designs; adaptive dynamic programming (ADP); discrete-time nonlinear systems; fast convergence rate; reinforcement learning; value iteration (VI); STABILITY ANALYSIS; VALUE-ITERATION; SUBJECT;
D O I
10.1109/TCYB.2022.3233593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Inspired by the successive relaxation method, a novel discounted iterative adaptive dynamic programming framework is developed, in which the iterative value function sequence possesses an adjustable convergence rate. The different convergence properties of the value function sequence and the stability of the closed-loop systems under the new discounted value iteration (VI) are investigated. Based on the properties of the given VI scheme, an accelerated learning algorithm with convergence guarantee is presented. Moreover, the implementations of the new VI scheme and its accelerated learning design are elaborated, which involve value function approximation and policy improvement. A nonlinear fourth-order ball-and-beam balancing plant is used to verify the performance of the developed approaches. Compared with the traditional VI, the present discounted iterative adaptive critic designs greatly accelerate the convergence rate of the value function and reduce the computational cost simultaneously.
引用
收藏
页码:3003 / 3016
页数:14
相关论文
共 50 条
  • [41] Adaptive critic learning with fuzzy utility
    Matzner, SA
    Shannon, TT
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 888 - 892
  • [42] Robust adaptive critic control design with network-based event-triggered formulation
    Chaoxu Mu
    Ding Wang
    Changyin Sun
    Qun Zong
    Nonlinear Dynamics, 2017, 90 : 2023 - 2035
  • [43] DISCOUNTED ESTIMATION AND DISCRETIZATION IN ADAPTIVE-CONTROL
    MANDL, P
    LAUSMANOVA, M
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1992, 184 : 321 - 329
  • [44] Robust adaptive critic control design with network-based event-triggered formulation
    Mu, Chaoxu
    Wang, Ding
    Sun, Changyin
    Zong, Qun
    NONLINEAR DYNAMICS, 2017, 90 (03) : 2023 - 2035
  • [45] Robust tracking control for nonlinear systems based on critic learning formulation with single network
    Huo Y.
    Wang D.
    Qiao J.-F.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3066 - 3074
  • [46] Adaptive critic designs for host-based intrusion detection
    Draelos, T
    Duggan, D
    Collins, M
    Wunsch, D
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1720 - 1725
  • [47] Adaptive critic designs (vol 8, pg 997, 1997)
    Prokhorov, DV
    Wunsch, DC
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (06): : 1563 - 1563
  • [48] Adaptive critic designs based coupled neurocontrollers for a static compensator
    Mohagheghi, Sahnan
    Venayagamoorthy, Ganesh K.
    Harley, Ronald G.
    PROCEEDINGS OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL, 2006, : 99 - +
  • [49] A new fuzzy identification method based on adaptive critic designs
    Zhang, Huaguang
    Luo, Yanhong
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 804 - 809
  • [50] Adaptive critic designs and their implementations on different neural network architectures
    Park, JW
    Venayagamoorthy, GK
    Harley, RG
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1879 - 1884