FIRST PASSAGE RISK PROBABILITY OPTIMALITY FOR CONTINUOUS TIME MARKOV DECISION PROCESSES

被引:4
|
作者
Huo, Haifeng [1 ]
Wen, Xian [1 ,2 ]
机构
[1] Guangxi Univ Sci & Technol, Sch Sci, Liuzhou 545006, Peoples R China
[2] Guangxi Univ Sci & Technol, Lushan Coll, Liuzhou 5450616, Peoples R China
基金
中国国家自然科学基金;
关键词
continuous time Markov decision processes; first passage time; risk probability criterion; optimal policy; MODELS; MINIMIZATION; CRITERION;
D O I
10.14736/kyb-2019-1-0114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerable state space, a Borel action space, unbounded transition rates and nonnegative reward function. The optimality criterion to be considered is the first passage risk probability criterion. To ensure the non-explosion of the state processes, we first introduce a so-called drift condition, which is weaker than the well known regular condition for semi-Markov decision processes (SMDPs). Furthermore, under some suitable conditions, by value iteration recursive approximation technique, we establish the optimality equation, obtain the uniqueness of the value function and the existence of optimal policies. Finally, two examples are used to illustrate our results.
引用
收藏
页码:114 / 133
页数:20
相关论文
共 50 条
  • [1] ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES
    Guo, Xianping
    Huang, Xiangxiang
    Zhang, Yi
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (03) : 1406 - 1424
  • [2] First Passage Risk Probability Minimization for Piecewise Deterministic Markov Decision Processes
    Wen, Xin
    Huo, Hai-feng
    Guo, Xian-ping
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2022, 38 (03): : 549 - 567
  • [3] First Passage Risk Probability Minimization for Piecewise Deterministic Markov Decision Processes
    Xin Wen
    Hai-feng Huo
    Xian-ping Guo
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2022, 38 : 549 - 567
  • [4] Optimal risk probability for first passage models in semi-Markov decision processes
    Huang, Yonghui
    Guo, Xianping
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2009, 359 (01) : 404 - 420
  • [5] First Passage Optimality for Continuous-Time Markov Decision Processes With Varying Discount Factors and History-Dependent Policies
    Guo, Xianping
    Song, Xinyuan
    Zhang, Yi
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (01) : 163 - 174
  • [6] MARKOV DECISION PROCESSES WITH A NEW OPTIMALITY CRITERION - CONTINUOUS TIME
    JAQUETTE, SC
    [J]. ANNALS OF STATISTICS, 1975, 3 (02): : 547 - 553
  • [7] The risk probability criterion for discounted continuous-time Markov decision processes
    Huo, Haifeng
    Zou, Xiaolong
    Guo, Xianping
    [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699
  • [8] The risk probability criterion for discounted continuous-time Markov decision processes
    Haifeng Huo
    Xiaolong Zou
    Xianping Guo
    [J]. Discrete Event Dynamic Systems, 2017, 27 : 675 - 699
  • [9] Bias optimality for multichain continuous-time Markov decision processes
    Guo, Xianping
    Song, XinYuan
    Zhang, Junyu
    [J]. OPERATIONS RESEARCH LETTERS, 2009, 37 (05) : 317 - 321
  • [10] Continuous-Time Markov Decision Processes Under the Risk-Sensitive First Passage Discounted Cost Criterion
    Qingda Wei
    Xian Chen
    [J]. Journal of Optimization Theory and Applications, 2023, 197 : 309 - 333