Robust control scheme for a class of uncertain nonlinear systems with completely unknown dynamics using data-driven reinforcement learning method

被引:20
|
作者
Jiang, He [1 ]
Zhang, Huaguang [1 ]
Cui, Yang [1 ]
Xiao, Geyang [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Box 134, Shenyang 110819, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Adaptive dynamic programming; Data-driven; Model-free; Neural networks; OPTIMAL TRACKING CONTROL; ZERO-SUM GAMES; DIFFERENTIAL GRAPHICAL GAMES; POLICY UPDATE ALGORITHM; OPTIMAL-CONTROL DESIGN; MARKOV JUMP SYSTEMS; PROGRAMMING ALGORITHM; TIME-SYSTEMS; ITERATION; SYNCHRONIZATION;
D O I
10.1016/j.neucom.2017.07.058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with the robust control issues for a class of uncertain nonlinear systems with completely unknown dynamics via a data-driven reinforcement learning method. Firstly, we formulate the optimal regulation control problem for the nominal system, and then, the robust controller for the original uncertain system is designed by adding a constant feedback gain to the optimal controller of the nominal system. Then, this scheme is extended to the optimal tracking control by means of augmented system and discount factor. It is also demonstrated that the proposed robust controller can achieve optimality with a new defined performance index function when there is no control perturbation. It is well known that the nonlinear optimal control problem relies on the solution of Hamilton-Jacobi-Bellman (HJB) equation, which is a nonlinear partial differential equation and impossible to be solved analytically. In order to overcome this difficulty, we introduce a model-based iterative learning algorithm to successively approximate the solution of HJB equation and provide its convergence proof. Subsequently, based on the structure of the model-based approach, a data-driven reinforcement learning method is derived, which only requires the sampling data from real system with different control inputs rather than the accurate mathematical system models. Neural networks (NNs) are utilized to implement this model-free method to approximate the optimal solutions and the least-square approach is employed to minimize the NN approximation residual errors. Finally, two numerical simulation examples are given to illustrate the effectiveness of our proposed method. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:68 / 77
页数:10
相关论文
共 50 条
  • [41] Robust adaptive fuzzy control for a class of uncertain nonaffine nonlinear systems with unknown control directions
    Doudou, Sofiane
    Khaber, Farid
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2021, 43 (14) : 3103 - 3119
  • [42] Robust H∞ Control of Uncertain Stochastic Nonlinear Systems Driven by Noise of Unknown Covariance
    Wei Bo
    Ji Haibo
    [J]. PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 2, 2008, : 779 - 784
  • [43] Data-Driven Fault-Tolerant Reinforcement Learning Containment Control for Nonlinear Multiagent Systems
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 416 - 426
  • [44] Adaptive Robust Stabilization for A Class of Uncertain Nonlinear Systems with Unknown Virtual Control Coefficients
    Wang, Yuchao
    Wu, Hansheng
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2014, : 165 - 170
  • [45] Data-driven robust iterative learning control of linear systems
    Zhang, Zezhou
    Zou, Qingze
    [J]. AUTOMATICA, 2024, 164
  • [46] Data-driven Robust Optimal Control Design for Uncertain Cascaded Systems Using Value Iteration
    Bian, Tao
    Jiang, Zhong-Ping
    [J]. 2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 7610 - 7615
  • [47] Safe Reinforcement Learning using Data-Driven Predictive Control
    Selim, Mahmoud
    Alanwar, Amr
    El-Kharashi, M. Watheq
    Abbas, Hazem M.
    Johansson, Karl H.
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA), 2022,
  • [48] Data-Driven Optimal Tracking Control for Discrete-Time Nonlinear Systems With Unknown Dynamics Using Deterministic ADP
    Song, Shijie
    Gong, Dawei
    Zhu, Minglei
    Zhao, Yuyang
    Huang, Cong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [49] Reinforcement Learning-Based Control for a Class of Nonlinear Systems with unknown control directions
    Song, Xiaoling
    Huang, Miao
    Wen, Gang
    Ma, Longhua
    Yao, Jiaqing
    Lu, Zheming
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2519 - 2524
  • [50] Optimized Formation Control Using Simplified Reinforcement Learning for a Class of Multiagent Systems With Unknown Dynamics
    Wen, Guoxing
    Chen, C. L. Philip
    Li, Bin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (09) : 7879 - 7888