Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes

被引:0
|
作者
Baier, C [1 ]
Haverkort, B
Hermanns, H
Katoen, JP
机构
[1] Univ Bonn, Inst Informat I, Bonn, Germany
[2] Univ Twente, Fac Elect Engn Math & Comp Sci, Enschede, Netherlands
[3] Univ Saarland, Dept Comp Sci, Homburg, Germany
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A continuous-time Markov decision process (CTMDP) is a generalization of a continuous-time Markov chain in which both probabilistic and nondeterministic choices co-exist. This paper presents an efficient algorithm to compute the maximum (or minimum) probability to reach a set of goal states within a given time bound in a uniform CTMDP, i.e., a CTMDP in which the delay time distribution per state visit is the same for all states. We prove that these probabilities coincide for (time-abstract) history-dependent and Markovian schedulers that resolve nondeterminism either deterministically or in a randomized way.
引用
收藏
页码:61 / 76
页数:16
相关论文
共 50 条
  • [31] Formal Synthesis of Control Policies for Continuous Time Markov Processes From Time-Bounded Temporal Logic Specifications
    Ayala, Ana Medina
    Andersson, Sean B.
    Belta, Calin
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (09) : 2568 - 2573
  • [32] Efficient maximum likelihood parameterization of continuous-time Markov processes
    McGibbon, Robert T.
    Pande, Vijay S.
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2015, 143 (03):
  • [33] CONTINUOUS-TIME MARKOV DECISION-PROCESSES (CTMDP) WITH NON-UNIFORMLY BOUNDED TRANSITION RATES
    SONG, JS
    [J]. SCIENTIA SINICA SERIES A-MATHEMATICAL PHYSICAL ASTRONOMICAL & TECHNICAL SCIENCES, 1988, 31 (11): : 1281 - 1291
  • [34] On continuous-time Markov processes in bargaining
    Houba, Harold
    [J]. ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
  • [35] Kolmogorov equations in fractional derivatives for the transition probabilities of continuous-time Markov processes
    Miroshin R.N.
    [J]. Vestnik St. Petersburg University, Mathematics, 2017, 50 (1) : 24 - 31
  • [36] Policy learning in continuous-time Markov decision processes using Gaussian Processes
    Bartocci, Ezio
    Bortolussi, Luca
    Brazdil, Tomas
    Milios, Dimitrios
    Sanguinetti, Guido
    [J]. PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
  • [37] Approximate Parameter Synthesis for Probabilistic Time-Bounded Reachability
    Han, Tingting
    Katoen, Joost-Pieter
    Mereacre, Alexandru
    [J]. RTSS: 2008 REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2008, : 173 - 182
  • [38] Discounted optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    [J]. 2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
  • [39] Average optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    Rieder, Ulrich
    [J]. ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
  • [40] The risk probability criterion for discounted continuous-time Markov decision processes
    Huo, Haifeng
    Zou, Xiaolong
    Guo, Xianping
    [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699