Adaptive Learning in Tracking Control Based on the Dual Critic Network Design

被引:157
|
作者
Ni, Zhen [1 ]
He, Haibo [1 ]
Wen, Jinyu [2 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Huazhong Univ Sci & Technol, Coll Elect Elect & Engn, Wuhan 430074, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Adaptive critic design (ACD); adaptive dynamic programming (ADP); internal goal; lyapunov stability analysis; online learning; reinforcement learning; tracking control; virtual reality; TIME NONLINEAR-SYSTEMS; FEEDBACK CONTROL; STATE-FEEDBACK; CONTROL SCHEME; POWER-SYSTEM; NEUROCONTROL; GENERATORS;
D O I
10.1109/TNNLS.2013.2247627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new adaptive dynamic programming approach by integrating a reference network that provides an internal goal representation to help the systems learning and optimization. Specifically, we build the reference network on top of the critic network to form a dual critic network design that contains the detailed internal goal representation to help approximate the value function. This internal goal signal, working as the reinforcement signal for the critic network in our design, is adaptively generated by the reference network and can also be adjusted automatically. In this way, we provide an alternative choice rather than crafting the reinforcement signal manually from prior knowledge. In this paper, we adopt the online action-dependent heuristic dynamic programming (ADHDP) design and provide the detailed design of the dual critic network structure. Detailed Lyapunov stability analysis for our proposed approach is presented to support the proposed structure from a theoretical point of view. Furthermore, we also develop a virtual reality platform to demonstrate the real-time simulation of our approach under different disturbance situations. The overall adaptive learning performance has been tested on two tracking control benchmarks with a tracking filter. For comparative studies, we also present the tracking performance with the typical ADHDP, and the simulation results justify the improved performance with our approach.
引用
下载
收藏
页码:913 / 928
页数:16
相关论文
共 50 条
  • [41] Actor-Critic-Based Optimal Adaptive Control Design for Morphing Aircraft
    Lee, Hanna
    Kim, Seong-hun
    Kim, Youdan
    IFAC PAPERSONLINE, 2020, 53 (02): : 14863 - 14868
  • [42] Adaptive critic learning techniques for automotive engine control
    Javaherian, H
    Liu, D
    Zhang, Y
    Kovalenko, O
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 4066 - 4071
  • [43] Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor-Critic Reinforcement Learning
    Chen, Lin
    Dai, Shi-Lu
    Dong, Chao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7520 - 7533
  • [44] Fully probabilistic control design in an adaptive critic framework
    Herzallah, Randa
    Karny, Miroslav
    NEURAL NETWORKS, 2011, 24 (10) : 1128 - 1135
  • [45] Adaptive critic design-based robust neural network control for nonlinear distributed parameter systems with unknown dynamics
    Luo, Yanhong
    Sun, Qiuye
    Zhang, Huaguang
    Cui, Lili
    NEUROCOMPUTING, 2015, 148 : 200 - 208
  • [46] Robust Trajectory Tracking of Uncertain Systems via Adaptive Critic Learning
    Zhao, Ziliang
    Zhu, Qinglin
    Guo, Bin
    COMPLEXITY, 2022, 2022
  • [47] An adaptive critic based neurocontroller for process control
    Govindhasamy, JJ
    McLoone, SF
    Irwin, GW
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 849 - 858
  • [48] Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems With Experimental Validation
    Na, Jing
    Lv, Yongfeng
    Zhang, Kaiqiang
    Zhao, Jun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 459 - 472
  • [49] Adaptive Identifier-Critic-Based Optimal Tracking Control for Nonlinear Systems with Experimental Validation
    Na, Jing
    Lv, Yongfeng
    Zhang, Kaiqiang
    Zhao, Jun
    IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2022, 52 (01) : 459 - 472
  • [50] Path-tracking Control of Quadrotor Using Adaptive Critic-based Neurofuzzy Controller
    Ramezani, Mohammad Sajad
    Ajami, Sahand
    Ghafarirad, Hamed
    2021 9TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2021, : 248 - 254