An Approximation Approach for the Deviation Matrix of Continuous-Time Markov Processes with Application to Markov Decision Theory

被引:3
|
作者
Leder, Nicole [1 ]
Heidergott, Bernd [2 ,3 ]
Hordijk, Arie [4 ]
机构
[1] Univ Hamburg, Dept Math, D-20146 Hamburg, Germany
[2] Vrije Univ Amsterdam, Dept Econometr & Operat Res, NL-1081 HV Amsterdam, Netherlands
[3] Vrije Univ Amsterdam, Tinbergen Inst, NL-1081 HV Amsterdam, Netherlands
[4] Leiden Univ, Math Inst, NL-2300 RA Leiden, Netherlands
关键词
BLACKWELL OPTIMALITY; OPTIMAL POLICIES; STATE-SPACE; AVERAGE; CHAINS; RECURRENCE; ADMISSION; GAMES;
D O I
10.1287/opre.1090.0786
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We present an update formula that allows the expression of the deviation matrix of a continuous-time Markov process with denumerable state space having generator matrix Q* through a continuous-time Markov process with generator matrix Q. We show that under suitable stability conditions the algorithm converges at a geometric rate. By applying the concept to three different examples, namely, the M/M/1 queue with vacations, the M/G/1 queue, and a tandem network, we illustrate the broad applicability of our approach. For a problem in admission control, we apply our approximation algorithm to Markov decision theory for computing the optimal control policy. Numerical examples are presented to highlight the efficiency of the proposed algorithm.
引用
收藏
页码:918 / 932
页数:15
相关论文
共 50 条
  • [31] Sufficiency of Markov Policies for Continuous-Time Markov Decision Processes and Solutions to Kolmogorov's Forward Equation for Jump Markov Processes
    Feinberg, Eugene A.
    Mandava, Manasa
    Shiryaev, Albert N.
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 5728 - 5732
  • [32] Policy learning in continuous-time Markov decision processes using Gaussian Processes
    Bartocci, Ezio
    Bortolussi, Luca
    Brazdil, Tomas
    Milios, Dimitrios
    Sanguinetti, Guido
    PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
  • [33] Discounted optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
  • [34] Average optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    Rieder, Ulrich
    ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
  • [35] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
    Guo, Xianping
    Vykertas, Mantas
    Zhang, Yi
    ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
  • [36] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
    Liu, Qiuli
    Tan, Hangsheng
    Guo, Xianping
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
  • [37] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
    Guo, Xianping
    Song, Xinyuan
    ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
  • [39] A survey of recent results on continuous-time Markov decision processes - Discussion
    Hu, Qiying
    TOP, 2006, 14 (02) : 248 - 251
  • [40] The risk probability criterion for discounted continuous-time Markov decision processes
    Huo, Haifeng
    Zou, Xiaolong
    Guo, Xianping
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699