Risk-Sensitive and Average Optimality in Markov Decision Processes

被引:0
|
作者
Sladky, Karel [1 ]
机构
[1] Acad Sci Czech Republic, Inst Informat Theory & Automat, CR-18208 Prague 8, Czech Republic
关键词
dynamic programming; stochastic models; risk analysis and management; DYNAMIC-PROGRAMMING RECURSIONS; CHAINS;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Markov chain is evaluated by an exponential utility function. We restrict ourselves on irreducible or unichain Markov models where risk-sensitive average optimality is independent of the starting state. As we show this problem is closely related to solution of (nonlinear) Poissonian equations and their connections with nonnegative matrices.
引用
收藏
页码:799 / 804
页数:6
相关论文
共 50 条
  • [41] Risk-sensitive and Mean Variance Optimality in Continuous-time Markov Decision Chains
    Sladky, Karel
    [J]. MATHEMATICAL METHODS IN ECONOMICS (MME 2018), 2018, : 497 - 502
  • [42] Average optimality for risk-sensitive control with general state space
    Jaskiewicz, Anna
    [J]. ANNALS OF APPLIED PROBABILITY, 2007, 17 (02): : 654 - 675
  • [43] CENTRAL MOMENTS AND RISK-SENSITIVE OPTIMALITY IN MARKOV REWARD CHAINS
    Sladky, Karel
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE: QUANTITATIVE METHODS IN ECONOMICS: MULTIPLE CRITERIA DECISION MAKING XIX, 2018, : 325 - 331
  • [44] Risk-sensitive finite-horizon piecewise deterministic Markov decision processes
    Huang, Yonghui
    Lian, Zhaotong
    Guo, Xianping
    [J]. OPERATIONS RESEARCH LETTERS, 2020, 48 (01) : 96 - 103
  • [45] Risk-sensitive Markov decision processes with long-run CVaR criterion
    Xia, Li
    Zhang, Luyao
    Glynn, Peter W.
    [J]. PRODUCTION AND OPERATIONS MANAGEMENT, 2023, 32 (12) : 4049 - 4067
  • [46] Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes
    O. L. V. Costa
    F. Dufour
    [J]. Mathematical Methods of Operations Research, 2021, 93 : 327 - 357
  • [47] Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes
    Costa, O. L. V.
    Dufour, F.
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2021, 93 (02) : 327 - 357
  • [48] Average Optimality in Nonhomogeneous Infinite Horizon Markov Decision Processes
    Wachs, Allise O.
    Schochetman, Irwin E.
    Smith, Robert L.
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2011, 36 (01) : 147 - 164
  • [49] AVERAGE COST MARKOV DECISION-PROCESSES - OPTIMALITY CONDITIONS
    HERNANDEZLERMA, O
    HENNET, JC
    LASSERRE, JB
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1991, 158 (02) : 396 - 406
  • [50] The value iteration algorithm in risk-sensitive average Markov decision chains with finite state space
    Cavazos-Cadena, R
    Montes-de-Oca, R
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2003, 28 (04) : 752 - 776