Discrete-time zero-sum games for Markov chains with risk-sensitive average cost criterion

被引:2
|
作者
Ghosh, Mrinal K. [1 ]
Golui, Subrata [2 ]
Pal, Chandan [2 ]
Pradhan, Somnath [3 ]
机构
[1] Indian Inst Sci, Dept Math, Bangalore 560012, India
[2] Indian Inst Technol Guwahati, Dept Math, Gauhati, Assam, India
[3] Queens Univ, Dept Math & Stat, Kingston, ON K7L 3N6, Canada
关键词
Risk-sensitive zero-sum game; Risk-sensitive average cost criterion; History dependent strategies; Shapley equations; Value function; Saddle point equilibrium; OPTIMAL STATIONARY POLICIES; DECISION-PROCESSES; LARGE DEVIATIONS; SPECTRAL THEORY; OPTIMALITY; COUNTEREXAMPLE;
D O I
10.1016/j.spa.2022.12.009
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study zero-sum stochastic games for controlled discrete time Markov chains with risk-sensitive average cost criterion with countable/compact state space and Borel action spaces. The payoff function is nonnegative and possibly unbounded for countable state space case and for compact state space case it is a real-valued and bounded function. For countable state space case, under a certain Lyapunov type stability assumption on the dynamics we establish the existence of the value and a saddle point equilibrium. For compact state space case we establish these results without any Lyapunov type stability assumptions. Using the stochastic representation of the principal eigenfunction of the associated optimality equation, we completely characterize all possible saddle point strategies in the class of stationary Markov strategies. Also, we present and analyze an illustrative example.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:40 / 74
页数:35
相关论文
共 50 条