Adaptive dynamic programming for online solution of a zero-sum differential game

被引:109
|
作者
Vrabie D. [1 ]
Lewis F. [2 ]
机构
[1] United Technologies Research Center, East Hartford
[2] Automation and Robotics Research Institute, University of Texas at Arlington, Fort Worth
来源
基金
美国国家科学基金会;
关键词
Approximate/Adaptive dynamic programming; Game algebraic Riccati equation; Nash equilibrium; Zero-sum differential game;
D O I
10.1007/s11768-011-0166-4
中图分类号
学科分类号
摘要
This paper will present an approximate/adaptive dynamic programming (ADP) algorithm, that uses the idea of integral reinforcement learning (IRL), to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost. The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation (CT-GARE), which underlies the game problem. We here show how the ADP techniques will enhance the capabilities of the offline method allowing an online solution without the requirement of complete knowledge of the system dynamics. The feasibility of the ADP scheme is demonstrated in simulation for a power system control application. The adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance. © 2011 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:353 / 360
页数:7
相关论文
共 50 条
  • [1] Adaptive dynamic programming for online solution of a zero-sum differential game
    Draguna VRABIE
    Frank LEWIS
    [J]. Control Theory and Technology, 2011, 9 (03) : 353 - 360
  • [2] Robust adaptive dynamic programming for a zero-sum differential game
    Yuan, Binbin
    Lu, Pingli
    Liu, Xiangdong
    Bian, Tao
    [J]. 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 2468 - 2473
  • [3] Adaptive Dynamic Programming Algorithm for Finding Online the Equilibrium Solution of the Two-Player Zero-Sum Differential Game
    Vrabie, Draguna
    Lewis, Frank
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [4] Model-free Adaptive Dynamic Programming for Online optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game
    Qin, Chunbin
    Zhang, Huaguang
    Luo, Yanhong
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3815 - 3820
  • [5] Robust Zero-Sum Differential Game for Uncertain Nonlinear systems via Adaptive Dynamic Programming
    Sun, Jingliang
    Liu, Chunsheng
    Wei, Along
    [J]. 2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 1387 - 1392
  • [6] Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
    Zhu, Yuanheng
    Zhao, Dongbin
    Li, Xiangjun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 714 - 725
  • [7] Robust adaptive dynamic programming for a three-player zero-sum differential game with unmatched uncertainties
    Lu, Pingli
    Liu, Xiangdong
    Bian, Tao
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 2565 - 2570
  • [8] Stochastic Recursive Zero-Sum Differential Game and Mixed Zero-Sum Differential Game
    Wei, Lifeng
    Wu, Zhen
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2012, 2012
  • [9] Output feedback adaptive dynamic programming for linear differential zero-sum games
    Rizvi, Syed Ali Asad
    Lin, Zongli
    Brown, Charles L.
    [J]. AUTOMATICA, 2020, 122
  • [10] Min-max adaptive dynamic programming for zero-sum differential games
    Sarbaz, Mohammad
    Sun, Wei
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2024,