DISCOUNTING AND AVERAGING IN GAMES ACROSS TIME SCALES

被引：0

作者：

Chatterjee, Krishnendu ^{[1
]}

Majumdar, Rupak ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Los Angeles, CA USA

来源：

INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE | 2012年 / 23卷 / 03期

关键词：

Games on graphs; discounted objectives; mean-payoff objectives;

D O I：

10.1142/S0129054112400308

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We introduce two-level discounted and mean-payoff games played by two players on a perfect-information stochastic game graph. The upper level game is a discounted or mean-payoff game and the lower level game is a (undiscounted) reachability game. Two-l evel games model hierarchical and sequential decision making under uncertainty across different time scales. For both discounted and mean-payoff two-level games, we show the existence of pure memoryless optimal strategies for both players and an ordered field property. We show that if there is only one player (Markov decision processes), then the values can be computed in polynomial time. It follows that whether the value of a player is equal to a given rational constant in two-level discounted or mean-payoff games can be decided in NP boolean AND coNP. We also give an alternate strategy improvement algorithm to compute the value.

引用

页码：609 / 625

页数：17

共 50 条

[21] EXISTENCE OF PERIODIC SOLUTIONS OF DYNAMIC EQUATIONS ON TIME SCALES BY AVERAGING
Guo, Ruichao
Li, Yong
Xing, Jiamin
Yang, Xue
DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES S, 2017, 10 (05): : 959 - 971
[22] Hyperbolic Discounting with Environmental Outcomes across Time, Space, and Probability
Sargisson, Rebecca J.
Schoner, Benedikt, V
PSYCHOLOGICAL RECORD, 2020, 70 (03): : 515 - 527
[23] Repeated games with stochastic discounting
Baye, MR
Jansen, DW
ECONOMICA, 1996, 63 (252) : 531 - 541
[24] Hyperbolic Discounting with Environmental Outcomes across Time, Space, and Probability
Rebecca J. Sargisson
Benedikt V. Schöner
The Psychological Record, 2020, 70 : 515 - 527
[25] Repeated games with general discounting
Obara, Ichiro
Park, Jaeok
JOURNAL OF ECONOMIC THEORY, 2017, 172 : 348 - 375
[26] Reputation in repeated games with no discounting
Watson, J
GAMES AND ECONOMIC BEHAVIOR, 1996, 15 (01) : 82 - 109
[27] Evolutionary Games with Different Time Scales of Strategy Updating
Zhang, Jianlei
Zhu, Yuying
Zhang, Chunyan
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 3775 - 3780
[28] DELAY DISCOUNTING ACROSS TIME: MENTAL TIME TRAVEL AND MONETARY DELAY DISCOUNTING FROM THE FUTURE AND PAST IN HEAVY ALCOHOL USERS
Moody, L. N.
Bickel, W. K.
ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2015, 39 : 38A - 38A
[29] An application of squared prediction errors of time domain averaging across all scales on gearbox fault detection based on vibration signals
Ferreira, R. J. P.
Wang, W.
Zuo, M. J.
Almeida, A. T.
RELIABILITY, RISK AND SAFETY: THEORY AND APPLICATIONS VOLS 1-3, 2010, : 179 - +
[30] DISCOUNTING VERSUS AVERAGING IN DYNAMIC-PROGRAMMING
LEHRER, E
MONDERER, D
GAMES AND ECONOMIC BEHAVIOR, 1994, 6 (01) : 97 - 113

← 1 2 3 4 5 →