An Approximation Approach for the Deviation Matrix of Continuous-Time Markov Processes with Application to Markov Decision Theory

被引：3

作者：

Leder, Nicole ^{[1
]}

Heidergott, Bernd ^{[2
,3
]}

Hordijk, Arie ^{[4
]}

机构：

[1] Univ Hamburg, Dept Math, D-20146 Hamburg, Germany

[2] Vrije Univ Amsterdam, Dept Econometr & Operat Res, NL-1081 HV Amsterdam, Netherlands

[3] Vrije Univ Amsterdam, Tinbergen Inst, NL-1081 HV Amsterdam, Netherlands

[4] Leiden Univ, Math Inst, NL-2300 RA Leiden, Netherlands

来源：

OPERATIONS RESEARCH | 2010年 / 58卷 / 04期

关键词：

BLACKWELL OPTIMALITY; OPTIMAL POLICIES; STATE-SPACE; AVERAGE; CHAINS; RECURRENCE; ADMISSION; GAMES;

D O I：

10.1287/opre.1090.0786

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

We present an update formula that allows the expression of the deviation matrix of a continuous-time Markov process with denumerable state space having generator matrix Q* through a continuous-time Markov process with generator matrix Q. We show that under suitable stability conditions the algorithm converges at a geometric rate. By applying the concept to three different examples, namely, the M/M/1 queue with vacations, the M/G/1 queue, and a tandem network, we illustrate the broad applicability of our approach. For a problem in admission control, we apply our approximation algorithm to Markov decision theory for computing the optimal control policy. Numerical examples are presented to highlight the efficiency of the proposed algorithm.

引用

页码：918 / 932

页数：15

共 50 条

[31] Sufficiency of Markov Policies for Continuous-Time Markov Decision Processes and Solutions to Kolmogorov's Forward Equation for Jump Markov Processes
Feinberg, Eugene A.
Mandava, Manasa
Shiryaev, Albert N.
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 5728 - 5732
[32] Policy learning in continuous-time Markov decision processes using Gaussian Processes
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
[33] Discounted optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
[34] Average optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
Rieder, Ulrich
ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
[35] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
Guo, Xianping
Vykertas, Mantas
Zhang, Yi
ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
[36] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
Liu, Qiuli
Tan, Hangsheng
Guo, Xianping
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
[37] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
Guo, Xianping
Song, Xinyuan
ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
[38] MARKOV DECISION-PROCESSES WITH CONTINUOUS-TIME PARAMETER - SCHOUTEN,FAV
SCHAL, M
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1984, 16 (03) : 392 - 393
[39] A survey of recent results on continuous-time Markov decision processes - Discussion
Hu, Qiying
TOP, 2006, 14 (02) : 248 - 251
[40] The risk probability criterion for discounted continuous-time Markov decision processes
Huo, Haifeng
Zou, Xiaolong
Guo, Xianping
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2017, 27 (04): : 675 - 699

← 1 2 3 4 5 →