Gradient-Based Algorithms With Intermediate Observations in Static and Differential Games

被引:0
|
作者
Hossain, Mohammad Safayet [1 ]
Simaan, Marwan A. [1 ]
Qu, Zhihua [1 ]
机构
[1] Univ Cent Florida, Dept Elect & Comp Engn, Orlando, FL 32816 USA
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Nash equilibrium; Static games; differential games; gradient-based minimization algorithms; NASH EQUILIBRIUM SEEKING; NUMERICAL-METHODS;
D O I
10.1109/ACCESS.2024.3523258
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In two-player static and differential games, strategic players often use available or delayed information about the other player's decisions and solve an optimization or optimal control problem to determine their strategic choices. Without this information, the player's ability to determine its optimal decisions becomes problematic. In this paper, we propose an approach in which each player implements an iterative discrete-time gradient-based algorithm that relies only on intermediate either current or prior observations about the other player's actions. We explore the implementation of such gradient play algorithms in the case of non-zero-sum static games and in the more complex case of differential games. We discuss the properties of these algorithms with heterogeneous stepsizes and derive explicit necessary and sufficient conditions on the game parameters in the objective functions and stepsizes that guarantee convergence to the Nash equilibrium in static games with quadratic objective functions. Examples in both static and differential games are presented to illustrate the results.
引用
收藏
页码:2694 / 2704
页数:11
相关论文
共 50 条
  • [21] Adapting Static and Contextual Representations for Policy Gradient-Based Summarization
    Lin, Ching-Sheng
    Jwo, Jung-Sing
    Lee, Cheng-Hsiung
    SENSORS, 2023, 23 (09)
  • [22] Gradient-based steering for vision-based crowd simulation algorithms
    Dutra, T. B.
    Marques, R.
    Cavalcante-Neto, J. B.
    Vidal, C. A.
    Pettre, J.
    COMPUTER GRAPHICS FORUM, 2017, 36 (02) : 337 - 348
  • [23] NORMALIZATION AND CONVERGENCE OF GRADIENT-BASED ALGORITHMS FOR ADAPTIVE IIR FILTERS
    RUPP, M
    SIGNAL PROCESSING, 1995, 46 (01) : 15 - 30
  • [24] GRADIENT-BASED ADAPTIVE ALGORITHMS FOR SYSTEMS WITH EXTERNAL FEEDBACK PATHS
    FLOCKTON, SJ
    IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1991, 138 (04) : 308 - 312
  • [25] Normalization and convergence of gradient-based algorithms for adaptive IIR filters
    Rupp, Markus, 1600, Elsevier Science B.V., Amsterdam, Netherlands (46):
  • [26] Topology design of composite rotor beam with gradient-based algorithms
    Ren, Yiru
    Xiang, Jinwu
    Lin, Zheqi
    AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2015, 87 (04): : 305 - 311
  • [27] Multi-image gradient-based algorithms for motion estimation
    Timoner, SJ
    Freeman, DM
    OPTICAL ENGINEERING, 2001, 40 (09) : 2003 - 2016
  • [28] Nonlinear adaptive recursive filtering using gradient-based algorithms
    Taiyua Univ of Technology, Shanxi, China
    International Conference on Signal Processing Proceedings, ICSP, 1998, 1 : 126 - 129
  • [29] OPTIMALITY IN THE CHOICE OF THE CONVERGENCE FACTOR FOR GRADIENT-BASED ADAPTIVE ALGORITHMS
    YASSA, FF
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (01): : 48 - 59
  • [30] Gradient-based Algorithms for the Automatic Construction of Fuzzy Cognitive Maps
    Madeiro, Salomao S.
    Von Zuben, Fernando J.
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 344 - 349