Dynamic compensator-based near-optimal control for unknown nonaffine systems via integral reinforcement learning

被引:7
|
作者
Lin, Jinquan [1 ]
Zhao, Bo [2 ]
Liu, Derong [3 ,4 ]
Wang, Yonghua [1 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] Beijing Normal Univ, Sch Syst Sci, Beijing 100875, Peoples R China
[3] Southern Univ Sci & Technol, Sch Syst Design & Intelligent Mfg, Shenzhen 518055, Peoples R China
[4] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Neuro-dynamic programming; Adaptive dynamic programming; Reinforcement learning; Optimal control; Neural networks; Dynamic compensator; CONTINUOUS-TIME; EXPERIENCE REPLAY; DESIGN; ALGORITHM;
D O I
10.1016/j.neucom.2023.126973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a dynamic compensator-based near-optimal control approach for unknown nonaffine nonlinear systems is developed by using integral reinforcement learning. Since system dynamics is unknown, it is difficult to obtain the optimal control policy via neuro-dynamic programming. To address this problem, a general dynamic compensator is introduced as the virtual control input to augment the unknown nonaffine nonlinear system as a partially unknown affine system. For the augmented system, a novel quadratic value function is designed with the system states, the actual control input and the virtual control input. The optimal control of the augmented system can be regarded as the near-optimal control for the original system since the novel optimal value function is an upper bound of the original optimal value function. In order to avoid the identification of system dynamics, the integral reinforcement learning framework is utilized to derive the optimal control based on the solution of Hamilton-Jacobi-Bellman equation via the critic-only structure. Meanwhile, the weight learning rule of the critic neural network is presented with the experience replay technique to relax the persistence of excitation condition. Moreover, the uniform ultimate boundedness of weight estimation errors and the stability of the closed-loop system are guaranteed by using the Lyapunov's direct method. Finally, simulation results of two examples demonstrate the effectiveness of the developed dynamic compensator-based near-optimal control method.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    NEURAL NETWORKS, 2014, 55 : 30 - 41
  • [22] Reinforcement learning-based near-optimal control for discrete time-delay singularly perturbed systems with unmeasurable states
    Xu, Meng
    Dai, Wei
    Zhang, Qirui
    Yang, Chunyu
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025,
  • [23] Adaptive Optimal Consensus Control of Multiagent Systems With Unknown Dynamics and Disturbances via Reinforcement Learning
    Chen L.
    Dong C.
    Dai S.-L.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2193 - 2203
  • [24] Near-optimal online control of dynamic discrete-event systems
    Grigorov, Lenko
    Rudie, Karen
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2006, 16 (04): : 419 - 449
  • [25] Near-Optimal Online Control of Dynamic Discrete-Event Systems
    Lenko Grigorov
    Karen Rudie
    Discrete Event Dynamic Systems, 2006, 16 : 419 - 449
  • [26] Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning
    Jing, Chonglin
    Wang, Chaoli
    Song, Hongkai
    Shi, Yibo
    Hao, Longyan
    MATHEMATICS, 2024, 12 (16)
  • [27] Fuzzy Reduced-Order Compensator-Based Stabilization for Interconnected Descriptor Systems via Integral Sliding Modes
    Li, Jinghao
    Zhang, Qingling
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (04): : 752 - 765
  • [28] Near-Optimal Control of Stochastic Recursive Systems Via Viscosity Solution
    Liangquan Zhang
    Qing Zhou
    Journal of Optimization Theory and Applications, 2018, 178 : 363 - 382
  • [29] Near-Optimal Control of Stochastic Recursive Systems Via Viscosity Solution
    Zhang, Liangquan
    Zhou, Qing
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2018, 178 (02) : 363 - 382
  • [30] Sliding mode learning compensator-based robust control of automotive steer-by-wire systems
    Kong, Huifang
    Zhang, Xiaoxue
    Wang, Hai
    Bao, Wei
    Jiang, Kaiwen
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2016, 26 (03) : 253 - 263