ARENA-INDEPENDENT FINITE-MEMORY DETERMINACY IN STOCHASTIC GAMES

被引:1
|
作者
Bouyer, Patricia [1 ]
Oualhadj, Youssouf [2 ]
Randour, Mickael [3 ,4 ]
Vandenhove, Pierre [1 ,3 ,4 ]
机构
[1] Univ Paris Saclay, CNRS, Lab Methodes Formelles, ENS Paris Saclay, F-91190 Gif Sur Yvette, France
[2] Univ Paris Est Creteil, LACL, F-94010 Creteil, France
[3] FRS FNRS, Brussels, Belgium
[4] UMONS Univ Mons, Mons, Belgium
关键词
two-player games on graphs; stochastic games; Markov decision processes; finite-memory determinacy; optimal strategies; COMPLEXITY; AUTOMATA;
D O I
10.46298/LMCS-19(4:18)2023
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games (i.e., Markov decision processes). Third, we characterize the sufficiency of AIFM strategies through two intuitive properties of objectives. This work extends a line of research started on deterministic games to stochastic ones.
引用
收藏
页码:1 / 18
页数:51
相关论文
共 50 条
  • [1] GAMES WHERE YOU CAN PLAY OPTIMALLY WITH ARENA-INDEPENDENT FINITE MEMORY
    Bouyer, Patricia
    Le Roux, Stephane
    Oualhadj, Youssouf
    Randour, Mickael
    Vandenhove, Pierre
    LOGICAL METHODS IN COMPUTER SCIENCE, 2022, 18 (01)
  • [2] Arena-Independent Memory Bounds for Nash Equilibria in Reachability Games
    Main, James C. A.
    41ST INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE, STACS 2024, 2024, 289
  • [3] Extending finite-memory determinacy to multi-player games
    Le Roux, Stephane
    Pauly, Arno
    INFORMATION AND COMPUTATION, 2018, 261 : 676 - 694
  • [4] The Complexity of Partial-Observation Stochastic Parity Games with Finite-Memory Strategies
    Chatterjee, Krishnendu
    Doyen, Laurent
    Nain, Sumit
    Vardi, Moshe Y.
    FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES, 2014, 8412 : 242 - 257
  • [5] Extending Finite Memory Determinacy to Multiplayer Games
    Le Roux, Stephane
    Pauly, Arno
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (218): : 27 - 40
  • [6] Agreement dynamics of finite-memory language games on networks
    W. X. Wang
    B. Y. Lin
    C. L. Tang
    G. R. Chen
    The European Physical Journal B, 2007, 60 : 529 - 536
  • [7] Agreement dynamics of finite-memory language games on networks
    Wang, W. X.
    Lin, B. Y.
    Tang, C. L.
    Chen, G. R.
    EUROPEAN PHYSICAL JOURNAL B, 2007, 60 (04): : 529 - 536
  • [8] Finite-Memory Systems
    Maria Alessandra Fasoli
    Multidimensional Systems and Signal Processing, 1998, 9 : 291 - 306
  • [9] Finite-memory systems
    Universitat Innsbruck, Innsbruck, Austria
    Multidimens Syst Signal Proc, 3 (291-306):
  • [10] Finite-memory systems
    Fasoli, MA
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 1998, 9 (03) : 291 - 306