Convergence. of controlled models and finite-state approximation for discounted continuous-time Markov decision processes with constraints

被引:19
|
作者
Guo, Xianping [1 ]
Zhang, Wenzhao [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Math & Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
关键词
Constrained continuous-time Markov decision processes; Unbounded transition rate; Convergence; Finite approximation;
D O I
10.1016/j.ejor.2014.03.037
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
In this paper we consider the convergence of a sequence {M-n} of the models of discounted continuoustime constrained Markov decision processes (MDP) to the "limit" one, denoted by M-infinity. For the models with denumerable states and unbounded transition rates, under reasonably mild conditions we prove that the (constrained) optimal policies and the optimal values of {M-n} converge to those of M-infinity, respectively, using a technique of occupation measures. As an application of the convergence result developed here, we show that an optimal policy and the optimal value for countable-state continuous-time MDP can be approximated by those of finite-state continuous-time MDP. Finally, we further illustrate such finite-state approximation by solving numerically a controlled birth-and-death system and also give the corresponding error bound of the approximation. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:486 / 496
页数:11
相关论文
共 50 条