Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games

被引：0

作者：

Markus N. Rabe

Sven Schewe

机构：

[1] Universität des Saarlandes,

[2] University of Liverpool,undefined

来源：

Acta Informatica | 2011年 / 48卷

关键词：

Markov Decision Process; Switching Point; Discrete Location; Goal Region; Continuous Location;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-time Markov games. Furthermore, we show that optimal control does not only exist, but has a surprisingly simple structure: the optimal schedulers from our proofs are deterministic and timed positional, and the bounded time can be divided into a finite number of intervals, in which the optimal strategies are positional. That is, we demonstrate the existence of finite optimal control. Finally, we show that these pleasant properties of Markov decision processes extend to the more general class of continuous-time Markov games, and that both early and late schedulers show this behaviour.

引用

共 50 条

[1] Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games
Rabe, Markus N.
Schewe, Sven
ACTA INFORMATICA, 2011, 48 (5-6) : 291 - 315
[2] Continuous-time stochastic games with time-bounded reachability
Brazdil, Tomas
Forejt, Vojtech
Krcal, Jan
Kretinsky, Jan
Kucera, Antonin
INFORMATION AND COMPUTATION, 2013, 224 : 46 - 70
[3] Optimal time-abstract schedulers for CTMDPs and continuous-time Markov games
Rabe, Markus N.
Schewe, Sven
THEORETICAL COMPUTER SCIENCE, 2013, 467 : 53 - 67
[4] A Lyapunov Approach for Time-Bounded Reachability of CTMCs and CTMDPs
Salamati, Mahmoud
Soudjani, Sadegh
Majumdar, Rupak
ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS, 2020, 5 (01)
[5] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
Baier, C
Hermanns, H
Katoen, JP
Haverkort, BR
THEORETICAL COMPUTER SCIENCE, 2005, 345 (01) : 2 - 26
[6] Efficient computation of time-bounded reachability probabilities in uniform continuous-time Markov decision processes
Baier, C
Haverkort, B
Hermanns, H
Katoen, JP
TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2004, 2988 : 61 - 76
[7] Efficient approximation of optimal control for continuous-time Markov games
Fearnley, John
Rabe, Markus N.
Schewe, Sven
Zhang, Lijun
INFORMATION AND COMPUTATION, 2016, 247 : 106 - 129
[8] Efficient Approximation of Optimal Control for Continuous-Time Markov Games
Fearnley, John
Rabe, Markus
Schewe, Sven
Zhang, Lijun
IARCS ANNUAL CONFERENCE ON FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE (FSTTCS 2011), 2011, 13 : 399 - 410
[9] Policy Learning for Time-Bounded Reachability in Continuous-Time Markov Decision Processes via Doubly-Stochastic Gradient Ascent
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2016, 2016, 9826 : 244 - 259
[10] Time-Bounded Model Checking of Infinite-State Continuous-Time Markov Chains
Hahn, E. Moritz
Hermanns, Holger
Wachter, Bjoern
Zhang, Lijun
FUNDAMENTA INFORMATICAE, 2009, 95 (01) : 129 - 155

← 1 2 3 4 5 →