Adaptive Learning in Tracking Control Based on the Dual Critic Network Design

被引:157
|
作者
Ni, Zhen [1 ]
He, Haibo [1 ]
Wen, Jinyu [2 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Huazhong Univ Sci & Technol, Coll Elect Elect & Engn, Wuhan 430074, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Adaptive critic design (ACD); adaptive dynamic programming (ADP); internal goal; lyapunov stability analysis; online learning; reinforcement learning; tracking control; virtual reality; TIME NONLINEAR-SYSTEMS; FEEDBACK CONTROL; STATE-FEEDBACK; CONTROL SCHEME; POWER-SYSTEM; NEUROCONTROL; GENERATORS;
D O I
10.1109/TNNLS.2013.2247627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new adaptive dynamic programming approach by integrating a reference network that provides an internal goal representation to help the systems learning and optimization. Specifically, we build the reference network on top of the critic network to form a dual critic network design that contains the detailed internal goal representation to help approximate the value function. This internal goal signal, working as the reinforcement signal for the critic network in our design, is adaptively generated by the reference network and can also be adjusted automatically. In this way, we provide an alternative choice rather than crafting the reinforcement signal manually from prior knowledge. In this paper, we adopt the online action-dependent heuristic dynamic programming (ADHDP) design and provide the detailed design of the dual critic network structure. Detailed Lyapunov stability analysis for our proposed approach is presented to support the proposed structure from a theoretical point of view. Furthermore, we also develop a virtual reality platform to demonstrate the real-time simulation of our approach under different disturbance situations. The overall adaptive learning performance has been tested on two tracking control benchmarks with a tracking filter. For comparative studies, we also present the tracking performance with the typical ADHDP, and the simulation results justify the improved performance with our approach.
引用
下载
收藏
页码:913 / 928
页数:16
相关论文
共 50 条
  • [31] Adaptive critic design with weight allocation for intelligent learning control of wastewater treatment plants
    Wang, Ding
    Ma, Hongyu
    Ren, Jin
    Gao, Ning
    Qiao, Junfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [32] Adaptive critic design in learning to play game of Go
    Zaman, R
    Prokhorov, D
    Wunsch, DC
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1 - 4
  • [33] Neural Network Adaptive Critic Control With Disturbance Rejection
    Wang, Ding
    Mu, Chaoxu
    Liu, Derong
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 202 - 207
  • [34] Neural network based tracking control for an elastic joint robot with input constraint via actor-critic design
    Ouyang, Yuncheng
    Dong, Lu
    Wei, Yanling
    Sun, Changyin
    NEUROCOMPUTING, 2020, 409 : 286 - 295
  • [35] Adaptive Critic Learning-Based Robust Control of Systems with Uncertain Dynamics
    Zhao, Jun
    Zeng, Qingliang
    Guo, Bin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [36] Learning and Guaranteed Cost Control With Event-Based Adaptive Critic Implementation
    Wang, Ding
    Liu, Derong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6004 - 6014
  • [37] Siamada: visual tracking based on Siamese adaptive learning network
    Xin Lu
    Fusheng Li
    Wanqi Yang
    Neural Computing and Applications, 2024, 36 : 7639 - 7656
  • [38] Siamada: visual tracking based on Siamese adaptive learning network
    Lu, Xin
    Li, Fusheng
    Yang, Wanqi
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (14): : 7639 - 7656
  • [39] The design of a neural network-based adaptive control method for robotic arm trajectory tracking
    Xu, Kun
    Wang, Zhiliang
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (12): : 8785 - 8795
  • [40] The design of a neural network-based adaptive control method for robotic arm trajectory tracking
    Kun Xu
    Zhiliang Wang
    Neural Computing and Applications, 2023, 35 : 8785 - 8795