Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

被引：0

作者：

Rolando Cavazos-Cadena

机构：

[1] Universidad Autónoma Agraria Antonio Narro,Departamento de Estadística y Cálculo

来源：

Mathematical Methods of Operations Research | 2010年 / 71卷

关键词：

First arrival time; Stopping problem with total cost index; Relative value function; Constant average cost; Stochastic matrix associated with a multiplicative Poisson equation; 93E20; 60J05; 93C55;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This note concerns controlled Markov chains on a denumerable sate space. The performance of a control policy is measured by the risk-sensitive average criterion, and it is assumed that (a) the simultaneous Doeblin condition holds, and (b) the system is communicating under the action of each stationary policy. If the cost function is bounded below, it is established that the optimal average cost is characterized by an optimality inequality, and it is to shown that, even for bounded costs, such an inequality may be strict at every state. Also, for a nonnegative cost function with compact support, the existence an uniqueness of bounded solutions of the optimality equation is proved, and an example is provided to show that such a conclusion generally fails when the cost is negative at some state.

引用

页码：47 / 84

页数：37

共 50 条

[1] Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains
Cavazos-Cadena, Rolando
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2010, 71 (01) : 47 - 84
[2] Risk-Sensitive Average Optimality in Markov Decision Chains
Sladky, Karel
Montes-de-Oca, Raul
[J]. OPERATIONS RESEARCH PROCEEDINGS 2007, 2008, : 69 - +
[3] Growth rates and average optimality in risk-sensitive Markov decision chains
Sladky, Karel
[J]. KYBERNETIKA, 2008, 44 (02) : 205 - 226
[4] Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions
Rolando Cavazos-Cadena
Emmanuel Fernández-Gaucherand
[J]. Mathematical Methods of Operations Research, 1999, 49 (2) : 299 - 324
[5] Controlled Markov chains with risk-sensitive criteria:: Average cost, optimality equations, and optimal solutions
Cavazos-Cadena, R
Fernández-Gaucherand, E
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1999, 49 (02) : 299 - 324
[6] Risk-Sensitive and Average Optimality in Markov Decision Processes
Sladky, Karel
[J]. PROCEEDINGS OF 30TH INTERNATIONAL CONFERENCE MATHEMATICAL METHODS IN ECONOMICS, PTS I AND II, 2012, : 799 - 804
[7] RISK-SENSITIVE AVERAGE OPTIMALITY IN MARKOV DECISION PROCESSES
Sladky, Karel
[J]. KYBERNETIKA, 2018, 54 (06) : 1218 - 1230
[8] Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria
Rolando Cavazos-Cadena
[J]. Mathematical Methods of Operations Research, 2009, 70 : 541 - 566
[9] Solutions of the average cost optimality equation for finite Markov decision chains: risk-sensitive and risk-neutral criteria
Cavazos-Cadena, Rolando
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2009, 70 (03) : 541 - 566
[10] Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains
Cavazos-Cadena, Rolando
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2018, 43 (03) : 1025 - 1050

← 1 2 3 4 5 →