Average criteria in denumerable semi-Markov decision chains under risk-aversion

被引:0
|
作者
Cavazos-Cadena, Rolando [1 ]
Cruz-Suarez, Hugo [2 ]
Montes-De-Oca, Raul [3 ]
机构
[1] Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Blvd Antonio Narro 1923, Saltillo 25315, Coah, Mexico
[2] Benemerita Univ Autonoma Puebla, Fac Ciencias Fisicomatemat, Ave San Claudio & Rio Verde, Puebla 72570, Pue, Mexico
[3] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave Ferrocaril San Rafael Atlixco 186,Col Leyes Re, Cdmx 09310, Mexico
关键词
Exponential utility function; Certainty equivalent; Total relative cost; Verification theorem; Cost structure with bounded support; INFINITE-HORIZON RISK; SENSITIVE CONTROL; OPTIMALITY; COST; SYSTEM;
D O I
10.1007/s10626-023-00376-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This note concerns with semi-Markov decision chains evolving on a denumerable state space. The system is directed by a risk-averse controller with constant risk-sensitivity, and the performance of a decision policy is measured by a long-run average criterion associated with bounded holding cost rates and one-step cost function. Under mild conditions on the sojourn times and the transition law, restrictions on the cost structure are given to ensure that the optimal average cost can be characterized via a bounded solution of the optimality equation. Such a result is used to establish a general characterization of the optimal average cost in terms of an optimality inequality from which an optimal stationary policy can be derived.
引用
收藏
页码:221 / 256
页数:36
相关论文
共 50 条
  • [21] Semi-markov decision processes nonstandard criteria
    Baykal-Guersoy, M.
    Guersoy, K.
    PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2007, 21 (04) : 635 - 657
  • [22] Finite horizon partially observable semi-Markov decision processes under risk probability criteria
    Wen, Xin
    Guo, Xianping
    Xia, Li
    Operations Research Letters, 2024, 57
  • [23] Constrained semi-markov decision processes with average rewards
    Feinberg, E.A.
    ZOR. Zeitschrift Fuer Operations Research, 1994, 40 (03):
  • [24] RECURRENCE CONDITIONS FOR AVERAGE AND BLACKWELL OPTIMALITY IN DENUMERABLE STATE MARKOV DECISION CHAINS
    DEKKER, R
    HORDIJK, A
    MATHEMATICS OF OPERATIONS RESEARCH, 1992, 17 (02) : 271 - 289
  • [25] Using Semi-Markov Chains to Solve Semi-Markov Processes
    Wu, Bei
    Maya, Brenda Ivette Garcia
    Limnios, Nikolaos
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2021, 23 (04) : 1419 - 1431
  • [26] Using Semi-Markov Chains to Solve Semi-Markov Processes
    Bei Wu
    Brenda Ivette Garcia Maya
    Nikolaos Limnios
    Methodology and Computing in Applied Probability, 2021, 23 : 1419 - 1431
  • [27] Constrained semi-Markov decision processes with ratio and time expected average criteria in Polish spaces
    Wei, Qingda
    Guo, Xianping
    OPTIMIZATION, 2015, 64 (07) : 1593 - 1623
  • [28] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
    Yong-hui Huang
    Guo Xian-ping
    Acta Mathematicae Applicatae Sinica, English Series, 2011, 27 : 177 - 190
  • [29] Attainability for Markov and Semi-Markov Chains
    Verbeken, Brecht
    Guerry, Marie-Anne
    MATHEMATICS, 2024, 12 (08)
  • [30] First passage models for denumerable semi-Markov decision processes with nonnegative discounted costs
    Huang, Yong-hui
    Guo, Xian-ping
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2011, 27 (02): : 177 - 190