Discrete-time zero-sum games for Markov chains with risk-sensitive average cost criterion

被引:2
|
作者
Ghosh, Mrinal K. [1 ]
Golui, Subrata [2 ]
Pal, Chandan [2 ]
Pradhan, Somnath [3 ]
机构
[1] Indian Inst Sci, Dept Math, Bangalore 560012, India
[2] Indian Inst Technol Guwahati, Dept Math, Gauhati, Assam, India
[3] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
Risk-sensitive zero-sum game; Risk-sensitive average cost criterion; History dependent strategies; Shapley equations; Value function; Saddle point equilibrium; OPTIMAL STATIONARY POLICIES; DECISION-PROCESSES; LARGE DEVIATIONS; SPECTRAL THEORY; OPTIMALITY; COUNTEREXAMPLE;
D O I
10.1016/j.spa.2022.12.009
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable/compact state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded for countable state space case and for compact state space case it is a real-valued and bounded function. For countable state space case, under a certain Lyapunov type stability assumption on the dynamics we establish the existence of the value and a saddle point equilibrium. For compact state space case we establish these results without any Lyapunov type stability assumptions. Using the stochastic representation of the principal eigenfunction of the associated optimality equation, we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Also, we present and analyze an illustrative example.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 74
页数:35
相关论文
共 50 条
  • [21] RISK-SENSITIVE AVERAGE OPTIMALITY FOR DISCRETE-TIME MARKOV DECISION PROCESSES
    Chen, Xian
    Wei, Qingda
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2023, 61 (01) : 72 - 104
  • [22] ZERO-SUM GAMES FOR PURE JUMP PROCESSES WITH RISK-SENSITIVE DISCOUNTED COST CRITERIA
    Pal, Chandan
    Pradhan, Somnath
    [J]. JOURNAL OF DYNAMICS AND GAMES, 2022, 9 (01): : 13 - 25
  • [23] Mean-field risk sensitive control and zero-sum games for Markov chains
    Choutri, Salah Eddine
    Djehiche, Boualem
    [J]. BULLETIN DES SCIENCES MATHEMATIQUES, 2019, 152 : 1 - 39
  • [24] Zero-sum stochastic games with the average-value-at-risk criterion
    Liu, Qiuli
    Ching, Wai-Ki
    Guo, Xianping
    [J]. TOP, 2023, 31 (03) : 618 - 647
  • [25] Zero-sum stochastic games with the average-value-at-risk criterion
    Qiuli Liu
    Wai-Ki Ching
    Xianping Guo
    [J]. TOP, 2023, 31 : 618 - 647
  • [26] Zero-sum risk-sensitive stochastic games on a countable state space
    Basu, Arnab
    Ghosh, Mrinal Kanti
    [J]. STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2014, 124 (01) : 961 - 983
  • [27] Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games
    Wei, Qinglai
    Liu, Derong
    Lin, Qiao
    Song, Ruizhuo
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 957 - 969
  • [28] Stochastic Zero-Sum Differential Games and H∞ Control of Discrete-time Markov Jump Systems
    Zhou Haiying
    Zhu Huainian
    Zhang Chengke
    [J]. 26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 151 - 156
  • [29] Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates
    Guo, XP
    Hernández-Lerma, O
    [J]. JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) : 327 - 345
  • [30] Zero-sum risk-sensitive stochastic differential games with reflecting diffusions in the orthant
    Ghosh, Mrinal Kanti
    Pradhan, Somnath
    [J]. ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2020, 26