DISCOUNTING AND AVERAGING IN GAMES ACROSS TIME SCALES

被引：0

作者：

Chatterjee, Krishnendu ^{[1
]}

Majumdar, Rupak ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Los Angeles, CA USA

来源：

INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE | 2012年 / 23卷 / 03期

关键词：

Games on graphs; discounted objectives; mean-payoff objectives;

D O I：

10.1142/S0129054112400308

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We introduce two-level discounted and mean-payoff games played by two players on a perfect-information stochastic game graph. The upper level game is a discounted or mean-payoff game and the lower level game is a (undiscounted) reachability game. Two-l evel games model hierarchical and sequential decision making under uncertainty across different time scales. For both discounted and mean-payoff two-level games, we show the existence of pure memoryless optimal strategies for both players and an ordered field property. We show that if there is only one player (Markov decision processes), then the values can be computed in polynomial time. It follows that whether the value of a player is equal to a given rational constant in two-level discounted or mean-payoff games can be decided in NP boolean AND coNP. We also give an alternate strategy improvement algorithm to compute the value.

引用

页码：609 / 625

页数：17

共 50 条

[31] DISCOUNTING OF DEVIANT INFORMATION IN A NUMBER AVERAGING TASK
LEVIN, IP
GIBBS, CM
BULLETIN OF THE PSYCHONOMIC SOCIETY, 1974, 4 (NA4) : 242 - 242
[32] Metabolic organization across scales of space and time
Lavaud, Romain (rlavaud@agcenter.lsu.edu), 2025, 500
[33] Imaging malaria parasites across scales and time
Guizetti, Julien
Journal of Microscopy,
[34] Epitaxial phenomena across length and time scales
Vvedensky, DD
SURFACE AND INTERFACE ANALYSIS, 2001, 31 (07) : 627 - 636
[35] Causality and Information Transfer Across Time Scales
Palus, Milan
2019 PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON MEASUREMENT (MEASUREMENT 2019), 2019, : 92 - 101
[36] Stability of graph communities across time scales
Delvenne, J. -C.
Yaliraki, S. N.
Barahona, M.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (29) : 12755 - 12760
[37] On LP Formulations of Optimal Control Problems with Time Averaging and Time Discounting Criteria in Non-Ergodic Case
Borkar, Vivek
Gaitsgory, Vladimir
Shvartsman, Ilya
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4298 - 4303
[38] Strategic risk and response time across games
Pablo Brañas-Garza
Debrah Meloso
Luis Miller
International Journal of Game Theory, 2017, 46 : 511 - 523
[39] Strategic risk and response time across games
Branas-Garza, Pablo
Meloso, Debrah
Miller, Luis
INTERNATIONAL JOURNAL OF GAME THEORY, 2017, 46 (02) : 511 - 523
[40] Exponential discounting in security games of timing
Merlevede, Jonathan
Johnson, Benjamin
Grossklags, Jens
Holvoet, Tom
JOURNAL OF CYBERSECURITY, 2021, 7 (01):

← 1 2 3 4 5 →