A characterization of the optimal risk-sensitive average cost in finite controlled Markov chains

被引：29

作者：

Cavazos-Cadena, R ^{[1
]}

Hernández-Hernández, D

机构：

[1] Univ Autonoma Agraria Antonio Narro, Dept Estadist & Calculo, Saltillo 25315, Coahuila, Mexico

[2] Ctr Invest Matemat, Guanajuato 36000, GTO, Mexico

来源：

ANNALS OF APPLIED PROBABILITY | 2005年 / 15卷 / 1A期

关键词：

decreasing function along trajectories; stopping time; nearly optimal policies; Holder's inequality; simultaneous Doeblin condition; recurrent state;

D O I：

10.1214/105051604000000585

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

This work concerns controlled Markov chains with finite state and action spaces. The transition law satisfies the simultaneous Doeblin condition, and the performance of a control policy is measured by the (long-run) risk-sensitive average cost criterion associated to a positive, but otherwise arbitrary, risk sensitivity coefficient. Within this context, the optimal risk-sensitive average cost is characterized via a minimization problem in a finite-dimensional Euclidean space.

引用

页码：175 / 212

页数：38

共 50 条

[41] Markov risk mappings and risk-sensitive optimal prediction
Tomasz Kosmala
Randall Martyr
John Moriarty
Mathematical Methods of Operations Research, 2023, 97 : 91 - 116
[42] A VARIATIONAL CHARACTERIZATION OF THE RISK-SENSITIVE AVERAGE REWARD FOR CONTROLLED DIFFUSIONS ON Rd
Arapostathis, Ari
Biswas, Anup
Borkar, Vivek S.
Kumar, K. Suresh
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2020, 58 (06) : 3785 - 3813
[43] Risk-sensitive control of continuous time Markov chains
Ghosh, Mrinal K.
Saha, Subhamay
STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2014, 86 (04) : 655 - 675
[44] Controlled Markov chains with exponential risk-sensitive criteria:: Modularity, structured policies and applications
Avila-Godoy, G
Fernández-Gaucherand, E
PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 778 - 783
[45] Continuous-time Markov decision processes under the risk-sensitive average cost criterion
Wei, Qingda
Chen, Xian
OPERATIONS RESEARCH LETTERS, 2016, 44 (04) : 457 - 462
[46] RISK-SENSITIVE AVERAGE MARKOV DECISION PROCESSES IN GENERAL SPACES
Chen, Xian
Wei, Qingda
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2024, 62 (04) : 2115 - 2147
[47] The vanishing discount approach in Markov chains with risk-sensitive criteria
Cavazos-Cadena, R
Fernández-Gaucherand, E
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (10) : 1800 - 1816
[48] Analysis of a risk-sensitive control problem for hidden Markov chains
Hernández-Hernández, D
Marcus, SI
Fard, PJ
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1999, 44 (05) : 1093 - 1100
[49] CENTRAL MOMENTS AND RISK-SENSITIVE OPTIMALITY IN MARKOV REWARD CHAINS
Sladky, Karel
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE: QUANTITATIVE METHODS IN ECONOMICS: MULTIPLE CRITERIA DECISION MAKING XIX, 2018, : 325 - 331
[50] Continuous-time zero-sum games for Markov chains with risk-sensitive finite-horizon cost criterion
Golui, Subrata
Pal, Chandan
STOCHASTIC ANALYSIS AND APPLICATIONS, 2022, 40 (01) : 78 - 95

← 1 2 3 4 5 →