Robust Control of Unknown Observable Nonlinear Systems Solved as a Zero-Sum Game

被引:27
|
作者
Radac, Mircea-Bogdan [1 ]
Lala, Timotei [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300223, Romania
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Mathematical model; Robust control; Games; Optimal control; Linear systems; Game theory; Roads; Active suspension system; approximate dynamic programming; neural networks; optimal control; reinforcement learning; state feedback; zero-sum two-player games; STATE-FEEDBACK CONTROL; DISCRETE-TIME-SYSTEMS; H-INFINITY CONTROL; VEHICLE SUSPENSION; LEARNING ALGORITHM; DESIGN;
D O I
10.1109/ACCESS.2020.3040185
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimal robust control solution for general nonlinear systems with unknown but observable dynamics is advanced here. The underlying Hamilton-Jacobi-Isaacs (HJI) equation of the corresponding zero-sum two-player game (ZS-TP-G) is learned using a Q-learning-based approach employing only input-output system measurements, assuming system observability. An equivalent virtual state-space model is built from the system's input-output samples and it is shown that controlling the former implies controlling the latter. Since the existence of a saddle-point solution to the ZS-TP-G is assumed unverifiable, the solution is derived in terms of upper-optimal and lower-optimal controllers. The learning convergence is theoretically ensured while practical implementation is performed using neural networks that provide scalability to the control problem dimension and automatic feature selection. The learning strategy is checked on an active suspension system, a good candidate for the robust control problem with respect to road profile disturbance rejection.
引用
收藏
页码:214153 / 214165
页数:13
相关论文
共 50 条
  • [21] Robust adaptive dynamic programming for a zero-sum differential game
    Yuan, Binbin
    Lu, Pingli
    Liu, Xiangdong
    Bian, Tao
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 2468 - 2473
  • [22] NAFTA IS NO ZERO-SUM GAME
    PALMER, MM
    FORTUNE, 1993, 128 (01) : 30 - 30
  • [23] Is Recognition a Zero-Sum Game?
    Shain, Ralph
    TELOS, 2008, (143): : 63 - 87
  • [24] Science is Not a Zero-Sum Game
    Mani, Devendra
    Zare, Richard
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2014, 19 (05): : 471 - 477
  • [25] Growth Is Not a Zero-Sum Game
    Burrell, Lisa
    MIT SLOAN MANAGEMENT REVIEW, 2019, 60 (03) : 1 - 1
  • [26] Robust containment control of multi-agent networks based on zero-sum game
    Yu D.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (08): : 1841 - 1848
  • [27] Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
    Zhu, Yuanheng
    Zhao, Dongbin
    Li, Xiangjun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 714 - 725
  • [28] Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems
    Xue, Shan
    Luo, Biao
    Liu, Derong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3189 - 3199
  • [29] NEURAL NETWORK OPTIMAL CONTROL FOR NONLINEAR SYSTEM BASED ON ZERO-SUM DIFFERENTIAL GAME
    Fu Xingjian
    Li Zizheng
    KYBERNETIKA, 2021, 57 (03) : 546 - 566
  • [30] Zero-Sum Differential Game-Based Fault-Tolerant Control for a Class of Affine Nonlinear Systems
    Ren, Hao
    Jiang, Bin
    Ma, Yajie
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 1272 - 1282