Perfect-Information Stochastic Games with Generalized Mean-Payoff Objectives

被引:11
|
作者
Chatterjee, Krishnendu [1 ]
Doyen, Laurent [2 ,3 ]
机构
[1] IST Austria, Klosterneuburg, Austria
[2] ENS Cachan, LSV, Cachan, France
[3] CNRS, F-75700 Paris, France
关键词
Stochastic games; Markov decision processes; Meanpayoff; COMPLEXITY; DETERMINACY;
D O I
10.1145/2933575.2934513
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graph games provide the foundation for modeling and synthesizing reactive processes. In the synthesis of stochastic reactive processes, the traditional model is perfect-information stochastic games, where some transitions of the game graph are controlled by two adversarial players, and the other transitions are executed probabilistically. We consider such games where the objective is the conjunction of several quantitative objectives (specified as mean-payoff conditions), which we refer to as generalized mean-payoff objectives. The basic decision problem asks for the existence of a finite-memory strategy for a player that ensures the generalized mean-payoff objective be satisfied with a desired probability against all strategies of the opponent. A special case of the decision problem is the almost-sure problem where the desired probability is 1. Previous results presented a semi-decision procedure for e-approximations of the almost-sure problem. In this work, we show that both the almost-sure problem as well as the general basic decision problem are coNP-complete, significantly improving the previous results. Moreover, we show that in the case of 1-player stochastic games, randomized memoryless strategies are sufficient and the problem can be solved in polynomial time. In contrast, in two-player stochastic games, we show that even with randomized strategies exponential memory is required in general, and present a matching exponential upper bound. We also study the basic decision problem with infinite-memory strategies and present computational complexity results for the problem. Our results are relevant in the synthesis of stochastic reactive systems with multiple quantitative requirements.
引用
收藏
页码:247 / 256
页数:10
相关论文
共 50 条
  • [1] Perfect-Information Stochastic Mean-Payoff Parity Games
    Chatterjee, Krishnendu
    Doyen, Laurent
    Gimbert, Hugo
    Oualhadj, Youssouf
    [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES, 2014, 8412 : 210 - 225
  • [2] Stochastic Window Mean-Payoff Games
    Doyen, Laurent
    Gaba, Pranshu
    Guha, Shibashis
    [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES, PT I, FOSSACS 2024, 2024, 14574 : 34 - 54
  • [3] Generalized Mean-payoff and Energy Games
    Chatterjee, Krishnendu
    Doyen, Laurent
    Henzinger, Thomas A.
    Raskin, Jean-Francois
    [J]. IARCS ANNUAL CONFERENCE ON FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE (FSTTCS 2010), 2010, 8 : 505 - 516
  • [4] Reduction of stochastic parity to stochastic mean-payoff games
    Chatterjee, Krishnendu
    Henzinger, Thomas A.
    [J]. INFORMATION PROCESSING LETTERS, 2008, 106 (01) : 1 - 7
  • [5] Energy and Mean-Payoff Games with Imperfect Information
    Degorre, Aldric
    Doyen, Laurent
    Gentilini, Raffaella
    Raskin, Jean-Francois
    Torunczyk, Szymon
    [J]. COMPUTER SCIENCE LOGIC, 2010, 6247 : 260 - +
  • [6] SUBGAME-PERFECT EQUILIBRIA IN MEAN-PAYOFF GAMES
    Brice, Leonard
    Raskin, Jean-Francois
    van den Bogaard, Marie
    [J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2023, 19 (04) : 1 - 6
  • [7] Perfect-information stochastic parity games
    Zielonka, W
    [J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATION STRUCTURES, PROCEEDINGS, 2004, 2987 : 499 - 513
  • [8] Mean-Payoff Pushdown Games
    Chatterjee, Krishnendu
    Velner, Yaron
    [J]. 2012 27TH ANNUAL ACM/IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE (LICS), 2012, : 195 - 204
  • [9] Mean-payoff parity games
    Chatterjee, K
    Henzinger, TA
    Jurdzinski, M
    [J]. LICS 2005: 20th Annual IEEE Symposium on Logic in Computer Science - Proceedings, 2005, : 178 - 187
  • [10] Generalized reinforcement learning in perfect-information games
    Maxwell Pak
    Bing Xu
    [J]. International Journal of Game Theory, 2016, 45 : 985 - 1011