Optimal threshold probability and expectation in semi-Markov decision processes

被引:10
|
作者
Sakaguchi, Masahiko [1 ]
Ohtsubo, Yoshio [1 ]
机构
[1] Kochi Univ, Fac Sci, Dept Math, Kochi 7808520, Japan
关键词
Semi-Markov decision process; Optimal threshold probability; Existence of optimal policy; Value iteration; Policy improvement method; Stochastic order; MINIMIZING RISK MODELS;
D O I
10.1016/j.amc.2010.04.007
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider undiscounted semi-Markov decision process with a target set and our main concern is a problem minimizing threshold probability. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists a stationary optimal policy. Also several value iteration methods and a policy improvement method are given in our model. Furthermore, we investigate a relationship between threshold probabilities and expectations for total rewards. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2947 / 2958
页数:12
相关论文
共 50 条
  • [21] OPTIMIZATION OF DENUMERABLE SEMI-MARKOV DECISION PROCESSES.
    Staniewski, Piotr
    Weinfeld, Roman
    [J]. Systems Science, 1980, 6 (02): : 129 - 141
  • [22] Semi-Markov decision processes with variance minimization criterion
    Qingda Wei
    Xianping Guo
    [J]. 4OR, 2015, 13 : 59 - 79
  • [23] On undiscounted semi-Markov decision processes with absorbing states
    Prasenjit Mondal
    [J]. Mathematical Methods of Operations Research, 2016, 83 : 161 - 177
  • [24] Constrained semi-markov decision processes with average rewards
    Feinberg, E.A.
    [J]. ZOR. Zeitschrift Fuer Operations Research, 1994, 40 (03):
  • [25] On undiscounted semi-Markov decision processes with absorbing states
    Mondal, Prasenjit
    [J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2016, 83 (02) : 161 - 177
  • [26] A Hemimetric Extension of Simulation for Semi-Markov Decision Processes
    Pedersen, Mathias Ruggaard
    Bacci, Giorgio
    Larsen, Kim Guldstrand
    Mardare, Radu
    [J]. QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2018, 2018, 11024 : 339 - 355
  • [27] SEMI-MARKOV DECISION-PROCESSES WITH POLYNOMIAL REWARD
    ROSBERG, Z
    [J]. JOURNAL OF APPLIED PROBABILITY, 1982, 19 (02) : 301 - 309
  • [28] Risk-aware semi-Markov decision processes
    Isohaetaelae, Jukka
    Haskell, William B.
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [29] Optimal stopping time on discounted semi-Markov processes
    Chen, Fang
    Guo, Xianping
    Liao, Zhong-Wei
    [J]. FRONTIERS OF MATHEMATICS IN CHINA, 2021, 16 (02) : 303 - 324
  • [30] Optimal maintenance of deteriorating equipment using semi-Markov decision processes and linear programming
    Kechagias, G. A.
    Diamantidis, A. C.
    Dimitrakos, T. D.
    Tsakalerou, M.
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING AND MANAGEMENT, 2024, 15 (01): : 81 - 95