Adaptive dynamic programming for online solution of a zero-sum differential game

被引：108

作者：

Vrabie D. ^{[1
]}

Lewis F. ^{[2
]}

机构：

[1] United Technologies Research Center, East Hartford

[2] Automation and Robotics Research Institute, University of Texas at Arlington, Fort Worth

来源：

Journal of Control Theory and Applications | 2011年 / 9卷 / 03期

基金：

美国国家科学基金会;

关键词：

Approximate/Adaptive dynamic programming; Game algebraic Riccati equation; Nash equilibrium; Zero-sum differential game;

D O I：

10.1007/s11768-011-0166-4

中图分类号：

学科分类号：

摘要：

This paper will present an approximate/adaptive dynamic programming (ADP) algorithm, that uses the idea of integral reinforcement learning (IRL), to determine online the Nash equilibrium solution for the two-player zerosum differential game with linear dynamics and infinite horizon quadratic cost. The algorithm is built around an iterative method that has been developed in the control engineering community for solving the continuous-time game algebraic Riccati equation (CT-GARE), which underlies the game problem. We here show how the ADP techniques will enhance the capabilities of the offline method allowing an online solution without the requirement of complete knowledge of the system dynamics. The feasibility of the ADP scheme is demonstrated in simulation for a power system control application. The adaptation goal is the best control policy that will face in an optimal manner the highest load disturbance. © 2011 South China University of Technology, Academy of Mathematics and Systems Science, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg.

引用

页码：353 / 360

页数：7

共 50 条

[1] Adaptive dynamic programming for online solution of a zero-sum differential game
Draguna VRABIE
Frank LEWIS
Control Theory and Technology, 2011, 9 (03) : 353 - 360
[2] Robust adaptive dynamic programming for a zero-sum differential game
Yuan, Binbin
Lu, Pingli
Liu, Xiangdong
Bian, Tao
2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 2468 - 2473
[3] Adaptive Dynamic Programming Algorithm for Finding Online the Equilibrium Solution of the Two-Player Zero-Sum Differential Game
Vrabie, Draguna
Lewis, Frank
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[4] Model-free Adaptive Dynamic Programming for Online optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game
Qin, Chunbin
Zhang, Huaguang
Luo, Yanhong
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3815 - 3820
[5] Robust Zero-Sum Differential Game for Uncertain Nonlinear systems via Adaptive Dynamic Programming
Sun, Jingliang
Liu, Chunsheng
Wei, Along
2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 1387 - 1392
[6] Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
Zhu, Yuanheng
Zhao, Dongbin
Li, Xiangjun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 714 - 725
[7] Robust adaptive dynamic programming for a three-player zero-sum differential game with unmatched uncertainties
Lu, Pingli
Liu, Xiangdong
Bian, Tao
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 2565 - 2570
[8] Stochastic Recursive Zero-Sum Differential Game and Mixed Zero-Sum Differential Game
Wei, Lifeng
Wu, Zhen
MATHEMATICAL PROBLEMS IN ENGINEERING, 2012, 2012
[9] Min-max adaptive dynamic programming for zero-sum differential games
Sarbaz, Mohammad
Sun, Wei
INTERNATIONAL JOURNAL OF CONTROL, 2024,
[10] Output feedback adaptive dynamic programming for linear differential zero-sum games
Rizvi, Syed Ali Asad
Lin, Zongli
Brown, Charles L.
AUTOMATICA, 2020, 122

← 1 2 3 4 5 →