Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:17
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [41] Improved saddle point prediction in stochastic two-player zero-sum games with a deep learning approach
    Wu, Dawen
    Lisser, Abdel
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [42] Decentralized Learning in Two-Player Zero-Sum Games: A LR-I Lagging Anchor Algorithm
    Lu, Xiaosong
    Schwartz, Howard M.
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 107 - 112
  • [43] A numerical optimization pesudo-algorithm for two-player zero-sum stochastic games
    Li, Peng
    Li, Xiangrong
    Yuan, Gonglin
    Zhang, Maojun
    APPLIED ECONOMICS, 2021, 53 (15) : 1729 - 1742
  • [44] Model-free finite-horizon optimal control of discrete-time two-player zero-sum games
    Wang, Wei
    Chen, Xin
    Du, Jianhua
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (01) : 167 - 179
  • [45] Sufficient Conditions for Optimality and Asymptotic Stability in Two-Player Zero-Sum Hybrid Games
    Leudo, Santiago J.
    Sanfelice, Ricardo G.
    HSCC 2022: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK 2022), 2022,
  • [46] Feature Selection Scheme Based on Zero-Sum Two-Player Game
    Touazi, Azzedine
    Mokdad, Fatiha
    Bouchaffra, Djamel
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1342 - 1347
  • [47] A dynamical neural network approach for solving stochastic two-player zero-sum games
    Wu, Dawen
    Lisser, Abdel
    NEURAL NETWORKS, 2022, 152 : 140 - 149
  • [48] Online solution of two-player zero-sum games for linear systems with unknown dynamics
    Fu, Yue
    Chai, Tian-You
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2015, 32 (02): : 196 - 201
  • [49] Solutions for zero-sum two-player games with noncompact decision sets and unbounded payoffs
    Feinberg, Eugene A.
    Kasyanov, Pavlo O.
    Zgurovsky, Michael Z.
    NAVAL RESEARCH LOGISTICS, 2023, 70 (05) : 493 - 506
  • [50] Sufficient Conditions for Optimality and Asymptotic Stability in Two-Player Zero-Sum Hybrid Games
    Jimenez Leudo, Santiago
    Sanfelice, Ricardo G
    HSCC 2022 - Proceedings of the 25th ACM International Conference on Hybrid Systems: Computation and Control, Part of CPS-IoT Week 2022, 2022,