Robust Control of Unknown Observable Nonlinear Systems Solved as a Zero-Sum Game

被引:27
|
作者
Radac, Mircea-Bogdan [1 ]
Lala, Timotei [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300223, Romania
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Mathematical model; Robust control; Games; Optimal control; Linear systems; Game theory; Roads; Active suspension system; approximate dynamic programming; neural networks; optimal control; reinforcement learning; state feedback; zero-sum two-player games; STATE-FEEDBACK CONTROL; DISCRETE-TIME-SYSTEMS; H-INFINITY CONTROL; VEHICLE SUSPENSION; LEARNING ALGORITHM; DESIGN;
D O I
10.1109/ACCESS.2020.3040185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimal robust control solution for general nonlinear systems with unknown but observable dynamics is advanced here. The underlying Hamilton-Jacobi-Isaacs (HJI) equation of the corresponding zero-sum two-player game (ZS-TP-G) is learned using a Q-learning-based approach employing only input-output system measurements, assuming system observability. An equivalent virtual state-space model is built from the system's input-output samples and it is shown that controlling the former implies controlling the latter. Since the existence of a saddle-point solution to the ZS-TP-G is assumed unverifiable, the solution is derived in terms of upper-optimal and lower-optimal controllers. The learning convergence is theoretically ensured while practical implementation is performed using neural networks that provide scalability to the control problem dimension and automatic feature selection. The learning strategy is checked on an active suspension system, a good candidate for the robust control problem with respect to road profile disturbance rejection.
引用
收藏
页码:214153 / 214165
页数:13
相关论文
共 50 条
  • [41] Clean Energy Is Not a Zero-Sum Game
    Sprovieri, John
    Assembly, 2022, 35 (11):
  • [42] Adaptive robust control without initial stabilizing for constrained-states nonlinear multiplayer mixed zero-sum game systems with matched input disturbances
    Qiao, Xiaopeng
    Qin, Chunbin
    Wang, Jinguang
    Zhang, Zhongwei
    Shang, Ziyang
    Applied Intelligence, 2025, 55 (02)
  • [43] Learning nonlinear robust control as a data-driven zero-sum two-player game for an active suspension system
    Radac, Mircea-Bogdan
    Lala, Timotei
    IFAC PAPERSONLINE, 2020, 53 (02): : 8057 - 8062
  • [44] Cooperation research on zero-sum game
    Li Rui
    Xie Nenggang
    Meng Rui
    Xu Gang
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3338 - 3344
  • [45] MEMORY RESEARCH IS NOT A ZERO-SUM GAME
    TULVING, E
    AMERICAN PSYCHOLOGIST, 1991, 46 (01) : 41 - 42
  • [46] Aging Theories and the Zero-Sum Game
    Goldsmith, Theodore C.
    REJUVENATION RESEARCH, 2014, 17 (01) : 1 - 2
  • [47] Event-Triggered Safe Control for the Zero-Sum Game of Nonlinear Safety-Critical Systems With Input Saturation
    Qin, Chunbin
    Zhu, Heyang
    Wang, Jinguang
    Xiao, Qiyang
    Zhang, Dehua
    IEEE ACCESS, 2022, 10 : 40324 - 40337
  • [48] A Robust Zero-Sum Game Framework for Pool-based Active Learning
    Zhu, Dixian
    Li, Zhe
    Wang, Xiaoyu
    Gong, Boqing
    Yang, Tianbao
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 517 - 526
  • [49] The loneliness of the zero-sum game loser. The balance of social exchange and belief in a zero-sum game as predictors of loneliness
    Borawski, Dominik
    PERSONALITY AND INDIVIDUAL DIFFERENCES, 2018, 135 : 270 - 276
  • [50] Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics
    Fu, Bin
    Sun, Bo
    Guo, Hang
    Yang, Tao
    Fu, Wenxing
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2833 - 2842