Average criteria in denumerable semi-Markov decision chains under risk-aversion

被引：0

作者：

Cavazos-Cadena, Rolando ^{[1
]}

Cruz-Suarez, Hugo ^{[2
]}

Montes-De-Oca, Raul ^{[3
]}

机构：

[1] Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Blvd Antonio Narro 1923, Saltillo 25315, Coah, Mexico

[2] Benemerita Univ Autonoma Puebla, Fac Ciencias Fisicomatemat, Ave San Claudio & Rio Verde, Puebla 72570, Pue, Mexico

[3] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave Ferrocaril San Rafael Atlixco 186,Col Leyes Re, Cdmx 09310, Mexico

来源：

DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS | 2023年 / 33卷 / 03期

关键词：

Exponential utility function; Certainty equivalent; Total relative cost; Verification theorem; Cost structure with bounded support; INFINITE-HORIZON RISK; SENSITIVE CONTROL; OPTIMALITY; COST; SYSTEM;

D O I：

10.1007/s10626-023-00376-w

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This note concerns with semi-Markov decision chains evolving on a denumerable state space. The system is directed by a risk-averse controller with constant risk-sensitivity, and the performance of a decision policy is measured by a long-run average criterion associated with bounded holding cost rates and one-step cost function. Under mild conditions on the sojourn times and the transition law, restrictions on the cost structure are given to ensure that the optimal average cost can be characterized via a bounded solution of the optimality equation. Such a result is used to establish a general characterization of the optimal average cost in terms of an optimality inequality from which an optimal stationary policy can be derived.

引用

页码：221 / 256

页数：36

共 50 条

[21] Semi-markov decision processes nonstandard criteria
Baykal-Guersoy, M.
Guersoy, K.
PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2007, 21 (04) : 635 - 657
[22] Finite horizon partially observable semi-Markov decision processes under risk probability criteria
Wen, Xin
Guo, Xianping
Xia, Li
Operations Research Letters, 2024, 57
[23] Constrained semi-markov decision processes with average rewards
Feinberg, E.A.
ZOR. Zeitschrift Fuer Operations Research, 1994, 40 (03):
[24] RECURRENCE CONDITIONS FOR AVERAGE AND BLACKWELL OPTIMALITY IN DENUMERABLE STATE MARKOV DECISION CHAINS
DEKKER, R
HORDIJK, A
MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (02) : 271 - 289
[25] Using Semi-Markov Chains to Solve Semi-Markov Processes
Wu, Bei
Maya, Brenda Ivette Garcia
Limnios, Nikolaos
METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2021, 23 (04) : 1419 - 1431
[26] Using Semi-Markov Chains to Solve Semi-Markov Processes
Bei Wu
Brenda Ivette Garcia Maya
Nikolaos Limnios
Methodology and Computing in Applied Probability, 2021, 23 : 1419 - 1431
[27] Constrained semi-Markov decision processes with ratio and time expected average criteria in Polish spaces
Wei, Qingda
Guo, Xianping
OPTIMIZATION, 2015, 64 (07) : 1593 - 1623
[28] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
Yong-hui Huang
Guo Xian-ping
Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 177 - 190
[29] Attainability for Markov and Semi-Markov Chains
Verbeken, Brecht
Guerry, Marie-Anne
MATHEMATICS, 2024, 12 (08)
[30] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
Huang, Yong-hui
Guo, Xian-ping
ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (02): : 177 - 190

← 1 2 3 4 5 →