Dynamic compensator-based near-optimal control for unknown nonaffine systems via integral reinforcement learning

被引:4
|
作者
Lin, Jinquan [1 ]
Zhao, Bo [2 ]
Liu, Derong [3 ,4 ]
Wang, Yonghua [1 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] Beijing Normal Univ, Sch Syst Sci, Beijing 100875, Peoples R China
[3] Southern Univ Sci & Technol, Sch Syst Design & Intelligent Mfg, Shenzhen 518055, Peoples R China
[4] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Neuro-dynamic programming; Adaptive dynamic programming; Reinforcement learning; Optimal control; Neural networks; Dynamic compensator; CONTINUOUS-TIME; EXPERIENCE REPLAY; DESIGN; ALGORITHM;
D O I
10.1016/j.neucom.2023.126973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a dynamic compensator-based near-optimal control approach for unknown nonaffine nonlinear systems is developed by using integral reinforcement learning. Since system dynamics is unknown, it is difficult to obtain the optimal control policy via neuro-dynamic programming. To address this problem, a general dynamic compensator is introduced as the virtual control input to augment the unknown nonaffine nonlinear system as a partially unknown affine system. For the augmented system, a novel quadratic value function is designed with the system states, the actual control input and the virtual control input. The optimal control of the augmented system can be regarded as the near-optimal control for the original system since the novel optimal value function is an upper bound of the original optimal value function. In order to avoid the identification of system dynamics, the integral reinforcement learning framework is utilized to derive the optimal control based on the solution of Hamilton-Jacobi-Bellman equation via the critic-only structure. Meanwhile, the weight learning rule of the critic neural network is presented with the experience replay technique to relax the persistence of excitation condition. Moreover, the uniform ultimate boundedness of weight estimation errors and the stability of the closed-loop system are guaranteed by using the Lyapunov's direct method. Finally, simulation results of two examples demonstrate the effectiveness of the developed dynamic compensator-based near-optimal control method.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
    Qiu, Yu-Qing
    Li, Yan
    Wang, Zhong
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) : 1319 - 1330
  • [2] Robust Near-optimal Control for Constrained Nonlinear System via Integral Reinforcement Learning
    Yu-Qing Qiu
    Yan Li
    Zhong Wang
    [J]. International Journal of Control, Automation and Systems, 2023, 21 : 1319 - 1330
  • [3] Data-Based Approximate Optimal Control for Unknown Nonaffine Systems via Dynamic Feedback
    Lin, Jinquan
    Zhao, Bo
    Liu, Derong
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1301 - 1305
  • [4] Data-based approximate optimal control for unknown nonaffine systems via dynamic feedback
    Lin, Jinquan
    Zhao, Bo
    Liu, Derong
    [J]. Proceedings of 2023 IEEE 12th Data Driven Control and Learning Systems Conference, DDCLS 2023, 2023, : 1301 - 1305
  • [5] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
    Zhang, Junzhe
    Bareinboim, Elias
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Policy Gradient-based Integral Reinforcement Learning for Optimal Control Design of Nonaffine Morphing Aircraft Systems
    Lee, Hanna
    Kim, Seong-Hun
    Kim, Youdan
    [J]. 2020 28TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2020, : 218 - 223
  • [7] Integral Reinforcement-Learning-Based Optimal Containment Control for Partially Unknown Nonlinear Multiagent Systems
    Wu, Qiuye
    Wu, Yongheng
    Wang, Yonghua
    [J]. ENTROPY, 2023, 25 (02)
  • [8] Near-Optimal Offline Reinforcement Learning via Double Variance Reduction
    Yin, Ming
    Bai, Yu
    Wang, Yu-Xiang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Event-triggered-based online integral reinforcement learning for optimal control of unknown constrained nonlinear systems
    Han, Xiumei
    Zhao, Xudong
    Wang, Ding
    Wang, Bohui
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 213 - 225
  • [10] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    Wang, Ding
    Liu, Derong
    Wei, Qinglai
    Zhao, Dongbin
    Jin, Ning
    [J]. AUTOMATICA, 2012, 48 (08) : 1825 - 1832