Games on graphs;
discounted objectives;
mean-payoff objectives;
D O I:
10.1142/S0129054112400308
中图分类号:
TP301 [理论、方法];
学科分类号:
081202 ;
摘要:
We introduce two-level discounted and mean-payoff games played by two players on a perfect-information stochastic game graph. The upper level game is a discounted or mean-payoff game and the lower level game is a (undiscounted) reachability game. Two-l evel games model hierarchical and sequential decision making under uncertainty across different time scales. For both discounted and mean-payoff two-level games, we show the existence of pure memoryless optimal strategies for both players and an ordered field property. We show that if there is only one player (Markov decision processes), then the values can be computed in polynomial time. It follows that whether the value of a player is equal to a given rational constant in two-level discounted or mean-payoff games can be decided in NP boolean AND coNP. We also give an alternate strategy improvement algorithm to compute the value.
机构:
Univ Denis Diderot Paris 7, LIAFA, Case 7014,2 Pl Jussieu, F-75251 Paris 05, FranceUniv Denis Diderot Paris 7, LIAFA, Case 7014,2 Pl Jussieu, F-75251 Paris 05, France
Gimbert, Hugo
Zielonka, Wieslaw
论文数: 0引用数: 0
h-index: 0
机构:
Univ Denis Diderot Paris 7, LIAFA, Case 7014,2 Pl Jussieu, F-75251 Paris 05, FranceUniv Denis Diderot Paris 7, LIAFA, Case 7014,2 Pl Jussieu, F-75251 Paris 05, France