Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:17
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [1] Stochastic Two-Player Zero-Sum Learning Differential Games
    Liu, Mushuang
    Wan, Yan
    Lewis, Frank L.
    Lopez, Victor G.
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 1038 - 1043
  • [2] Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback
    Cai, Yang
    Luo, Haipeng
    Wei, Chen-Yu
    Zheng, Weiqiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Inverse Two-Player Zero-Sum Dynamic Games
    Tsai, Dorian
    Molloy, Timothy L.
    Perez, Tristan
    2016 AUSTRALIAN CONTROL CONFERENCE (AUCC), 2016, : 192 - 196
  • [4] A theorem on symmetric two-player zero-sum games
    Laffond, G
    Laslier, JF
    LeBreton, M
    JOURNAL OF ECONOMIC THEORY, 1997, 72 (02) : 426 - 431
  • [5] Large Scale Learning of Agent Rationality in Two-Player Zero-Sum Games
    Ling, Chun Kai
    Fang, Fei
    Kolterl, J. Zico
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6104 - 6111
  • [6] Using CNN for solving two-player zero-sum games
    Wu, Dawen
    Lisser, Abdel
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [7] Multigrid methods for two-player zero-sum stochastic games
    Akian, Marianne
    Detournay, Sylvie
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2012, 19 (02) : 313 - 342
  • [8] Policy Similarity Measure for Two-Player Zero-Sum Games
    Tang, Hongsong
    Xiang, Liuyu
    He, Zhaofeng
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [9] Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
    Chen, Zixiang
    Zhou, Dongruo
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [10] Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
    Zhu, Yuanheng
    Zhao, Dongbin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) : 1228 - 1241