A characterization of the optimal risk-sensitive average cost in finite controlled Markov chains

被引:29
|
作者
Cavazos-Cadena, R [1 ]
Hernández-Hernández, D
机构
[1] Univ Autonoma Agraria Antonio Narro, Dept Estadist & Calculo, Saltillo 25315, Coahuila, Mexico
[2] Ctr Invest Matemat, Guanajuato 36000, GTO, Mexico
来源
ANNALS OF APPLIED PROBABILITY | 2005年 / 15卷 / 1A期
关键词
decreasing function along trajectories; stopping time; nearly optimal policies; Holder's inequality; simultaneous Doeblin condition; recurrent state;
D O I
10.1214/105051604000000585
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This work concerns controlled Markov chains with finite state and action spaces. The transition law satisfies the simultaneous Doeblin condition, and the performance of a control policy is measured by the (long-run) risk-sensitive average cost criterion associated to a positive, but otherwise arbitrary, risk sensitivity coefficient. Within this context, the optimal risk-sensitive average cost is characterized via a minimization problem in a finite-dimensional Euclidean space.
引用
收藏
页码:175 / 212
页数:38
相关论文
共 50 条
  • [1] Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains
    Cavazos-Cadena, Rolando
    MATHEMATICS OF OPERATIONS RESEARCH, 2018, 43 (03) : 1025 - 1050
  • [2] Discounted approximations to the risk-sensitive average cost in finite Markov chains
    Cavazos-Cadena, Rolando
    Cruz-Suarez, Daniel
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2017, 450 (02) : 1345 - 1362
  • [3] Controlled Markov chains with risk-sensitive exponential average cost criterion
    Brau, A
    Fernandez-Gaucherand, E
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2260 - 2264
  • [4] Controlled Markov chains with risk-sensitive criteria: Average cost, optimality equations, and optimal solutions
    Rolando Cavazos-Cadena
    Emmanuel Fernández-Gaucherand
    Mathematical Methods of Operations Research, 1999, 49 (2) : 299 - 324
  • [5] Controlled Markov chains with risk-sensitive criteria:: Average cost, optimality equations, and optimal solutions
    Cavazos-Cadena, R
    Fernández-Gaucherand, E
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 1999, 49 (02) : 299 - 324
  • [6] Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
    Chavez-Rodriguez, Selene
    Cavazos-Cadena, Rolando
    Cruz-Suarez, Hugo
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 170 (02) : 670 - 686
  • [7] Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
    Selene Chávez-Rodríguez
    Rolando Cavazos-Cadena
    Hugo Cruz-Suárez
    Journal of Optimization Theory and Applications, 2016, 170 : 670 - 686
  • [8] Discounted approximations in risk-sensitive average Markov cost chains with finite state space
    Rubén Blancas-Rivera
    Rolando Cavazos-Cadena
    Hugo Cruz-Suárez
    Mathematical Methods of Operations Research, 2020, 91 : 241 - 268
  • [9] Discounted approximations in risk-sensitive average Markov cost chains with finite state space
    Blancas-Rivera, Ruben
    Cavazos-Cadena, Rolando
    Cruz-Suarez, Hugo
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2020, 91 (02) : 241 - 268
  • [10] A poisson equation for the risk-sensitive average cost in semi-markov chains
    Rolando Cavazos-Cadena
    Discrete Event Dynamic Systems, 2016, 26 : 633 - 656