H∞ Control for Discrete-time Linear Systems by Integrating Off-policy Q-learning and Zero-sum Game

被引:0
|
作者
Li, Jinna [1 ]
Ding, Zhengtao [2 ]
Yang, Chunyu [3 ]
Niu, Hong [4 ]
机构
[1] Liaoning Shihua Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China
[2] Univ Manchester, Sch Elect & Elect Engn, Manchester M13 9PL, Lancs, England
[3] China Univ Min & Technol, Sch Informat & Elect Engn, Xuzhou 221116, Jiangsu, Peoples R China
[4] Liaoning Shihua Univ, Coll Sci, Fushun 113001, Peoples R China
关键词
ITERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on solving H infinity control problem by proposing a novel off-policy Q-learning algorithm under the framework of zero-sum game for discrete-time (DT) linear systems, using only the measured data along the system trajectories. Firstly, H infinity control problem is formulated, followed by the standard on-policy Q-learning algorithm in order to learn the H infinity controller gain. Secondly, behavior control policy and behavior disturbance policy are introduced, and an off-policy Q-function based game Bellman equation (OPQ-GBE) is derived. Consequently, an off-policy Q-learning algorithm is developed for the first time for discrete-time linear systems subject to the external disturbance, and the convergence and no bias of solution of OPQ-BE are proven. Finally, a F-16 aircraft autopilot is given to verify the effectiveness of the proposed method.
引用
收藏
页码:817 / 822
页数:6
相关论文
共 50 条
  • [1] Zero-sum game-based optimal control for discrete-time Markov jump systems: A parallel off-policy Q-learning method
    Wang, Yun
    Fang, Tian
    Kong, Qingkai
    Li, Feng
    APPLIED MATHEMATICS AND COMPUTATION, 2024, 467
  • [2] Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems
    Luo, Biao
    Yang, Yin
    Liu, Derong
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (07) : 3630 - 3640
  • [3] H∞ Control for Discrete-Time Multi-Player Systems via Off-Policy Q-Learning
    Li, Jinna
    Xiao, Zhenfei
    IEEE ACCESS, 2020, 8 (08): : 28831 - 28846
  • [4] H∞ control of linear discrete-time systems: Off-policy reinforcement learning
    Kiumarsi, Bahare
    Lewis, Frank L.
    Jiang, Zhong-Ping
    AUTOMATICA, 2017, 78 : 144 - 152
  • [5] Seeking Nash Equilibrium for Linear Discrete-time Systems via Off-policy Q-learning
    Ni, Haohan
    Ji, Yuxiang
    Yang, Yuxiao
    Zhou, Jianping
    IAENG International Journal of Applied Mathematics, 2024, 54 (11) : 2477 - 2483
  • [6] Off-policy inverse Q-learning for discrete-time antagonistic unknown systems
    Lian, Bosen
    Xue, Wenqian
    Xie, Yijing
    Lewis, Frank L.
    Davoudi, Ali
    AUTOMATICA, 2023, 155
  • [7] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
    Rizvi, Syed Ali Asad
    Lin, Zongli
    AUTOMATICA, 2018, 95 : 213 - 221
  • [8] Off-Policy Interleaved Q-Learning: Optimal Control for Affine Nonlinear Discrete-Time Systems
    Li, Jinna
    Chai, Tianyou
    Lewis, Frank L.
    Ding, Zhengtao
    Jiang, Yi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1308 - 1320
  • [9] Finite-horizon Q-learning for discrete-time zero-sum games with application to H∞$$ {H}_{\infty } $$ control
    Liu, Mingxiang
    Cai, Qianqian
    Meng, Wei
    Li, Dandan
    Fu, Minyue
    ASIAN JOURNAL OF CONTROL, 2023, 25 (04) : 3160 - 3168
  • [10] H∞ Tracking Control of Unknown Discrete-Time Linear Systems via Output-Data-Driven Off-policy Q-learning Algorithm
    Zhang, Kun
    Liu, Xuantong
    Zhang, Lei
    Chen, Qian
    Peng, Yunjian
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2350 - 2356