Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

被引:14
|
作者
Cavazos-Cadena, Rolando [1 ]
机构
[1] Univ Autonoma Agr Antonio Narro, Saltillo 25315, Coah, Mexico
关键词
First arrival time; Stopping problem with total cost index; Relative value function; Constant average cost; Stochastic matrix associated with a multiplicative Poisson equation; OPTIMAL STATIONARY POLICIES; CRITERION; EXISTENCE;
D O I
10.1007/s00186-009-0285-6
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This note concerns controlled Markov chains on a denumerable sate space. The performance of a control policy is measured by the risk-sensitive average criterion, and it is assumed that (a) the simultaneous Doeblin condition holds, and (b) the system is communicating under the action of each stationary policy. If the cost function is bounded below, it is established that the optimal average cost is characterized by an optimality inequality, and it is to shown that, even for bounded costs, such an inequality may be strict at every state. Also, for a nonnegative cost function with compact support, the existence an uniqueness of bounded solutions of the optimality equation is proved, and an example is provided to show that such a conclusion generally fails when the cost is negative at some state.
引用
收藏
页码:47 / 84
页数:38
相关论文
共 50 条
  • [21] Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion
    Selene Chávez-Rodríguez
    Rolando Cavazos-Cadena
    Hugo Cruz-Suárez
    Journal of Optimization Theory and Applications, 2016, 170 : 670 - 686
  • [22] A poisson equation for the risk-sensitive average cost in semi-markov chains
    Cavazos-Cadena, Rolando
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2016, 26 (04): : 633 - 656
  • [23] Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space: An alternative approach
    Rolando Cavazos-Cadena
    Daniel Hernández-Hernández
    Mathematical Methods of Operations Research, 2003, 56 : 473 - 479
  • [24] Solution to the risk-sensitive average optimality equation in communicating Markov decision chains with finite state space:: An alternative approach
    Cavazos-Cadena, R
    Hernández-Hernández, D
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 56 (03) : 473 - 479
  • [25] Risk-sensitive and Mean Variance Optimality in Continuous-time Markov Decision Chains
    Sladky, Karel
    MATHEMATICAL METHODS IN ECONOMICS (MME 2018), 2018, : 497 - 502
  • [26] CENTRAL MOMENTS AND RISK-SENSITIVE OPTIMALITY IN MARKOV REWARD CHAINS
    Sladky, Karel
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE: QUANTITATIVE METHODS IN ECONOMICS: MULTIPLE CRITERIA DECISION MAKING XIX, 2018, : 325 - 331
  • [27] Discounted approximations in risk-sensitive average Markov cost chains with finite state space
    Rubén Blancas-Rivera
    Rolando Cavazos-Cadena
    Hugo Cruz-Suárez
    Mathematical Methods of Operations Research, 2020, 91 : 241 - 268
  • [28] Discounted approximations in risk-sensitive average Markov cost chains with finite state space
    Blancas-Rivera, Ruben
    Cavazos-Cadena, Rolando
    Cruz-Suarez, Hugo
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2020, 91 (02) : 241 - 268
  • [29] Risk-Sensitive and Mean Variance Optimality in Markov Decision Processes
    Sladky, Karel
    Sitar, Milan
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON MATHEMATICAL METHODS IN ECONOMICS 2008, 2008, : 451 - 459
  • [30] Cumulative Optimality in Risk-Sensitive and Risk-Neutral Markov Reward Chains
    Sladky, Karel
    MATHEMATICAL METHODS IN ECONOMICS 2013, PTS I AND II, 2013, : 814 - 819