Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games

被引：0

作者：

Markus N. Rabe

Sven Schewe

机构：

[1] Universität des Saarlandes,

[2] University of Liverpool,undefined

来源：

Acta Informatica | 2011年 / 48卷

关键词：

Markov Decision Process; Switching Point; Discrete Location; Goal Region; Continuous Location;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-time Markov games. Furthermore, we show that optimal control does not only exist, but has a surprisingly simple structure: the optimal schedulers from our proofs are deterministic and timed positional, and the bounded time can be divided into a finite number of intervals, in which the optimal strategies are positional. That is, we demonstrate the existence of finite optimal control. Finally, we show that these pleasant properties of Markov decision processes extend to the more general class of continuous-time Markov games, and that both early and late schedulers show this behaviour.

引用

共 50 条

[41] Impulsive control for continuous-time Markov decision processes
Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
33405, France
不详
L69 7ZL, United Kingdom
Adv Appl Probab, 1 (106-127):
[42] Control of continuous-time Markov chains with safety constraints
Hsu, Shun-Pin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (05) : 492 - 503
[43] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Dufour, Francois
Piunovskiy, Alexei B.
ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
[44] On certain average characteristics of finite continuous-time Markov chains
Satin Y.A.
Zeifman A.I.
Journal of Mathematical Sciences, 2015, 205 (1) : 100 - 104
[45] NOTE ON FINITE HOMOGENEOUS CONTINUOUS-TIME MARKOV-CHAINS
TAVARE, S
BIOMETRICS, 1979, 35 (04) : 831 - 834
[46] SEQUENTIAL ESTIMATION FOR CONTINUOUS-TIME FINITE MARKOV-PROCESSES
ADKE, SR
MANJUNATH, SM
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1984, 13 (09) : 1089 - 1106
[47] Stability estimates for finite homogeneous continuous-time Markov chains
Mitrophanov, A. Yu.
THEORY OF PROBABILITY AND ITS APPLICATIONS, 2006, 50 (02) : 319 - 326
[48] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Guo, Xianping
Huang, Yonghui
Zhang, Yi
APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02): : 317 - 341
[49] Optimal consumption and insurance: A continuous-time Markov chain approach
Kraft, Holger
Steffensen, Mogens
ASTIN BULLETIN, 2008, 38 (01): : 231 - 257
[50] Relatively optimal control for continuous-time systems
Blanchini, Franco
Fujisaki, Yasumasa
PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5649 - +

← 1 2 3 4 5 →