Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games

被引:0
|
作者
Markus N. Rabe
Sven Schewe
机构
[1] Universität des Saarlandes,
[2] University of Liverpool,undefined
来源
Acta Informatica | 2011年 / 48卷
关键词
Markov Decision Process; Switching Point; Discrete Location; Goal Region; Continuous Location;
D O I
暂无
中图分类号
学科分类号
摘要
We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-time Markov games. Furthermore, we show that optimal control does not only exist, but has a surprisingly simple structure: the optimal schedulers from our proofs are deterministic and timed positional, and the bounded time can be divided into a finite number of intervals, in which the optimal strategies are positional. That is, we demonstrate the existence of finite optimal control. Finally, we show that these pleasant properties of Markov decision processes extend to the more general class of continuous-time Markov games, and that both early and late schedulers show this behaviour.
引用
收藏
相关论文
共 50 条
  • [31] Time-Bounded Reachability in Distributed Input/Output Interactive Probabilistic Chains
    Cahn, Georgel
    Crouzen, Pepijn
    D'Argenio, Pedro R.
    Hahn, E. Moritz
    Zhang, Lijun
    MODEL CHECKING SOFTWARE, 2010, 6349 : 193 - +
  • [32] PASSAGE-TIME GENERATING FUNCTIONS FOR CONTINUOUS-TIME FINITE MARKOV CHAINS
    DARROCH, JN
    MORRIS, KW
    JOURNAL OF APPLIED PROBABILITY, 1968, 5 (02) : 414 - &
  • [33] Γ-Finite-time stabilization of continuous-time systems with optimal performance
    Jin, Nana
    Xu, Juanjuan
    Zhang, Huanshui
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2021, 42 (02): : 590 - 602
  • [34] FINITE-SETTLING-TIME CONTROL OF CONTINUOUS-TIME PLANTS
    ICHIKAWA, K
    SYSTEMS & CONTROL LETTERS, 1987, 9 (04) : 341 - 343
  • [35] Continuous-time stochastic games
    Neyman, Abraham
    GAMES AND ECONOMIC BEHAVIOR, 2017, 104 : 92 - 130
  • [36] Continuous-time games of timing
    Laraki, R
    Solan, E
    Vieille, N
    JOURNAL OF ECONOMIC THEORY, 2005, 120 (02) : 206 - 238
  • [37] Reputation in Continuous-Time Games
    Faingold, Eduardo
    Sannikov, Yuliy
    ECONOMETRICA, 2011, 79 (03) : 773 - 876
  • [38] CONTINUOUS-TIME REPEATED GAMES
    BERGIN, J
    MACLEOD, WB
    INTERNATIONAL ECONOMIC REVIEW, 1993, 34 (01) : 21 - 37
  • [39] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
    Xianping Guo
    Yonghui Huang
    Yi Zhang
    Applied Mathematics & Optimization, 2017, 75 : 317 - 341
  • [40] SEQUENTIAL ESTIMATION FOR CONTINUOUS-TIME FINITE MARKOV-PROCESSES
    ADKE, SR
    MANJUNATH, SM
    STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 1984, 18 (02) : 227 - 227