Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:17
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [31] Upper bounds and Cost Evaluation in Dynamic Two-player Zero-sum Games
    Leudo, Santiago J.
    Ferrante, Francesco
    Sanfelice, Ricardo G.
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 424 - 429
  • [32] Corruption-Robust Offline Two-Player Zero-Sum Markov Games
    Nika, Andi
    Mandal, Debmalya
    Singla, Adish
    Radanovic, Goran
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [33] Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games
    Zeng, Sihan
    Doan, Thinh
    Romberg, Justin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [34] Equilibrium payoffs in repeated two-player zero-sum games of finite automata
    O. V. Baskov
    International Journal of Game Theory, 2019, 48 : 423 - 431
  • [35] Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information
    Wiggers, Auke J.
    Oliehoek, Frans A.
    Roijers, Diederik M.
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1628 - 1629
  • [36] GPI-Based design for partially unknown nonlinear two-player zero-sum games
    Yu, Lin
    Xiong, Junlin
    Xie, Min
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (03): : 2068 - 2088
  • [37] Two-player nonzero-sum and zero-sum games subject to stochastic noncausal systems
    Chen, Xin
    Zhang, Zeyu
    Zhang, Yijia
    Yuan, Dongmei
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [38] Optimal Control of Two-Player Systems With Output Feedback
    Lessard, Laurent
    Lall, Sanjay
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (08) : 2129 - 2144
  • [39] Adversarial Learning for Safe Highway Driving based on Two-Player Zero-Sum Game
    Li, Fangjian
    Zhao, Mengtao
    Wagner, John
    Wang, Yue
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 472 - 477
  • [40] Reinforcement Learning Based Solution to Two-player Zero-sum Game Using Differentiator
    Guo, Xinxin
    Yan, Weisheng
    Cui, Peng
    Zhang, Shouxu
    2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 708 - 713