Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes

被引：0

作者：

Baier, C ^{[1
]}

Haverkort, B

Hermanns, H

Katoen, JP

机构：

[1] Univ Bonn, Inst Informat I, Bonn, Germany

[2] Univ Twente, Fac Elect Engn Math & Comp Sci, Enschede, Netherlands

[3] Univ Saarland, Dept Comp Sci, Homburg, Germany

来源：

TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS | 2004年 / 2988卷

关键词：

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

A continuous-time Markov decision process (CTMDP) is a generalization of a continuous-time Markov chain in which both probabilistic and nondeterministic choices co-exist. This paper presents an efficient algorithm to compute the maximum (or minimum) probability to reach a set of goal states within a given time bound in a uniform CTMDP, i.e., a CTMDP in which the delay time distribution per state visit is the same for all states. We prove that these probabilities coincide for (time-abstract) history-dependent and Markovian schedulers that resolve nondeterminism either deterministically or in a randomized way.

引用

页码：61 / 76

页数：16

共 50 条

[31] Formal Synthesis of Control Policies for Continuous Time Markov Processes From Time-Bounded Temporal Logic Specifications
Ayala, Ana Medina
Andersson, Sean B.
Belta, Calin
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (09) : 2568 - 2573
[32] Efficient maximum likelihood parameterization of continuous-time Markov processes
McGibbon, Robert T.
Pande, Vijay S.
[J]. JOURNAL OF CHEMICAL PHYSICS, 2015, 143 (03):
[33] CONTINUOUS-TIME MARKOV DECISION-PROCESSES (CTMDP) WITH NON-UNIFORMLY BOUNDED TRANSITION RATES
SONG, JS
[J]. SCIENTIA SINICA SERIES A-MATHEMATICAL PHYSICAL ASTRONOMICAL & TECHNICAL SCIENCES, 1988, 31 (11): : 1281 - 1291
[34] On continuous-time Markov processes in bargaining
Houba, Harold
[J]. ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
[35] Kolmogorov equations in fractional derivatives for the transition probabilities of continuous-time Markov processes
Miroshin R.N.
[J]. Vestnik St. Petersburg University, Mathematics, 2017, 50 (1) : 24 - 31
[36] Policy learning in continuous-time Markov decision processes using Gaussian Processes
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
[J]. PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
[37] Approximate Parameter Synthesis for Probabilistic Time-Bounded Reachability
Han, Tingting
Katoen, Joost-Pieter
Mereacre, Alexandru
[J]. RTSS: 2008 REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2008, : 173 - 182
[38] Discounted optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
[J]. 2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
[39] Average optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
Rieder, Ulrich
[J]. ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
[40] The risk probability criterion for discounted continuous-time Markov decision processes
Huo, Haifeng
Zou, Xiaolong
Guo, Xianping
[J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699

← 1 2 3 4 5 →