Adaptive Learning in Tracking Control Based on the Dual Critic Network Design

被引:157
|
作者
Ni, Zhen [1 ]
He, Haibo [1 ]
Wen, Jinyu [2 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Huazhong Univ Sci & Technol, Coll Elect Elect & Engn, Wuhan 430074, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Adaptive critic design (ACD); adaptive dynamic programming (ADP); internal goal; lyapunov stability analysis; online learning; reinforcement learning; tracking control; virtual reality; TIME NONLINEAR-SYSTEMS; FEEDBACK CONTROL; STATE-FEEDBACK; CONTROL SCHEME; POWER-SYSTEM; NEUROCONTROL; GENERATORS;
D O I
10.1109/TNNLS.2013.2247627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new adaptive dynamic programming approach by integrating a reference network that provides an internal goal representation to help the systems learning and optimization. Specifically, we build the reference network on top of the critic network to form a dual critic network design that contains the detailed internal goal representation to help approximate the value function. This internal goal signal, working as the reinforcement signal for the critic network in our design, is adaptively generated by the reference network and can also be adjusted automatically. In this way, we provide an alternative choice rather than crafting the reinforcement signal manually from prior knowledge. In this paper, we adopt the online action-dependent heuristic dynamic programming (ADHDP) design and provide the detailed design of the dual critic network structure. Detailed Lyapunov stability analysis for our proposed approach is presented to support the proposed structure from a theoretical point of view. Furthermore, we also develop a virtual reality platform to demonstrate the real-time simulation of our approach under different disturbance situations. The overall adaptive learning performance has been tested on two tracking control benchmarks with a tracking filter. For comparative studies, we also present the tracking performance with the typical ADHDP, and the simulation results justify the improved performance with our approach.
引用
下载
收藏
页码:913 / 928
页数:16
相关论文
共 50 条
  • [1] Adaptive Critic Tracking Design for Data-Based Nonaffine Predictive Control
    Wang, Ding
    Xin, Peng
    Ren, Jin
    Qiao, Junfei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 21 (04) : 1 - 12
  • [2] Robust tracking control for nonlinear systems based on critic learning formulation with single network
    Huo Y.
    Wang D.
    Qiao J.-F.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3066 - 3074
  • [3] Neural network-based adaptive critic designs for self-learning control
    Liu, DR
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1252 - 1256
  • [4] Sequential learning for adaptive critic design: An industrial control application
    Govindhasamy, JJ
    McLoone, SF
    Irwin, GW
    2005 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2005, : 265 - 270
  • [5] Asymptotic tracking by a reinforcement learning-based adaptive critic controller
    Shubhendu BHASIN
    Nitin SHARMA
    Parag PATRE
    Warren DIXON
    Control Theory and Technology, 2011, 9 (03) : 400 - 409
  • [6] Asymptotic tracking by a reinforcement learning-based adaptive critic controller
    Bhasin S.
    Sharma N.
    Patre P.
    Dixon W.
    Journal of Control Theory and Applications, 2011, 9 (3): : 400 - 409
  • [7] Fuzzy Optimal Tracking Control of Hypersonic Flight Vehicles via Single-Network Adaptive Critic Design
    Bu, Xiangwei
    Qi, Qiang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (01) : 270 - 278
  • [8] Robust adaptive critic control design with network-based event-triggered formulation
    Chaoxu Mu
    Ding Wang
    Changyin Sun
    Qun Zong
    Nonlinear Dynamics, 2017, 90 : 2023 - 2035
  • [9] Robust adaptive critic control design with network-based event-triggered formulation
    Mu, Chaoxu
    Wang, Ding
    Sun, Changyin
    Zong, Qun
    NONLINEAR DYNAMICS, 2017, 90 (03) : 2023 - 2035
  • [10] Design and Implementation of an Adaptive Cruise Control System Based on Supervised Actor-Critic Learning
    Wang, Bin
    Zhao, Dongbin
    Li, Chengdong
    Dai, Yujie
    2015 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2015, : 243 - 248