Average criteria in denumerable semi-Markov decision chains under risk-aversion

被引:0
|
作者
Cavazos-Cadena, Rolando [1 ]
Cruz-Suarez, Hugo [2 ]
Montes-De-Oca, Raul [3 ]
机构
[1] Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Blvd Antonio Narro 1923, Saltillo 25315, Coah, Mexico
[2] Benemerita Univ Autonoma Puebla, Fac Ciencias Fisicomatemat, Ave San Claudio & Rio Verde, Puebla 72570, Pue, Mexico
[3] Univ Autonoma Metropolitana Iztapalapa, Dept Matemat, Ave Ferrocaril San Rafael Atlixco 186,Col Leyes Re, Cdmx 09310, Mexico
关键词
Exponential utility function; Certainty equivalent; Total relative cost; Verification theorem; Cost structure with bounded support; INFINITE-HORIZON RISK; SENSITIVE CONTROL; OPTIMALITY; COST; SYSTEM;
D O I
10.1007/s10626-023-00376-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This note concerns with semi-Markov decision chains evolving on a denumerable state space. The system is directed by a risk-averse controller with constant risk-sensitivity, and the performance of a decision policy is measured by a long-run average criterion associated with bounded holding cost rates and one-step cost function. Under mild conditions on the sojourn times and the transition law, restrictions on the cost structure are given to ensure that the optimal average cost can be characterized via a bounded solution of the optimality equation. Such a result is used to establish a general characterization of the optimal average cost in terms of an optimality inequality from which an optimal stationary policy can be derived.
引用
收藏
页码:221 / 256
页数:36
相关论文
共 50 条