Robust control under worst-case uncertainty for unknown nonlinear systems using modified reinforcement learning

被引:32
|
作者
Perrusquia, Adolfo [1 ]
Yu, Wen [1 ]
机构
[1] CINVESTAV IPN, Natl Polytech Inst, Dept Automat Control, Ave IPN 2508, Mexico City 07360, DF, Mexico
关键词
k-nearest neighbors; double estimator; overestimation; robust reward; state-action space; worst-case uncertainty; POLICY; STABILIZATION; DESIGN;
D O I
10.1002/rnc.4911
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is an effectivemethod for the design of robust controllers of unknown nonlinear systems. Normal RLs for robust control, such as actor-critic (AC) algorithms, depend on the estimation accuracy. Uncertainty in the worst case requires a large state-action space, this causes overestimation and computational problems. In this article, the RL method is modified with the k-nearest neighbor and the double Q-learning algorithm. The modified RL does not need the neural estimator as AC and can stabilize the unknown nonlinear system under the worst-case uncertainty. The convergence property of the proposed RL method is analyzed. The simulations and the experimental results show that our modified RLs are much more robust compared with the classic controllers, such as the proportional-integral-derivative, the sliding mode, and the optimal linear quadratic regulator controllers.
引用
收藏
页码:2920 / 2936
页数:17
相关论文
共 50 条
  • [41] Robust adaptive beamforming using worst-case performance optimization
    Gershman, AB
    Luo, ZQ
    Shahbazpanahi, S
    Vorobyov, SA
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1353 - 1357
  • [42] Optimal, robust predictive control of nonlinear systems under probabilistic uncertainty using particles
    Blackmore, Lars
    Williams, Brian C.
    [J]. 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 3366 - 3368
  • [43] Probabilistically Robust Learning: Balancing Average- and Worst-case Performance
    Robey, Alexander
    Chamon, Luiz F. O.
    Pappas, George J.
    Hassani, Hamed
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [44] Robust control scheme for a class of uncertain nonlinear systems with completely unknown dynamics using data-driven reinforcement learning method
    Jiang, He
    Zhang, Huaguang
    Cui, Yang
    Xiao, Geyang
    [J]. NEUROCOMPUTING, 2018, 273 : 68 - 77
  • [45] Reinforcement Learning-Based Control for a Class of Nonlinear Systems with unknown control directions
    Song, Xiaoling
    Huang, Miao
    Wen, Gang
    Ma, Longhua
    Yao, Jiaqing
    Lu, Zheming
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 2519 - 2524
  • [46] Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    Wei, Qinglai
    [J]. NEURAL NETWORKS, 2014, 55 : 30 - 41
  • [47] Worst-Case Spoofing Attack and Robust Countermeasure in Satellite Navigation Systems
    Crosara, Laura
    Ardizzon, Francesco
    Tomasin, Stefano
    Laurenti, Nicola
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 2039 - 2050
  • [48] THE ROBUST H-2 CONTROL PROBLEM - A WORST-CASE DESIGN
    STOORVOGEL, AA
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1993, 38 (09) : 1358 - 1370
  • [49] Robust Worst-Case Interference Control in Underlay Cognitive Radio Networks
    Parsaeefard, Saeedeh
    Sharafat, Ahmad R.
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2012, 61 (08) : 3731 - 3745
  • [50] A Nonlinear Tolerance Analysis Method Using Worst-Case and Matlab
    Yu, Meiqiong
    Yan, Yan
    Hao, Jia
    Wang, Guoxin
    [J]. ADVANCED MANUFACTURING SYSTEMS, PTS 1-3, 2011, 201-203 : 247 - 252