Continuous-Time Controlled Markov Chains with Discounted Rewards

被引:0
|
作者
Xianping Guo
Onésimo Hernández-Lerma
机构
[1] Zhongshan University,The School of Mathematics and Computational Science
[2] CINVESTAV-IPN,Departamento de Matemáticas
来源
关键词
continuous-time controlled Markov chains; unbounded reward and transition rates; discounted criterion; optimal stationary policies; martingale characterization;
D O I
暂无
中图分类号
学科分类号
摘要
This paper studies denumerable state continuous-time controlled Markov chains with the discounted reward criterion and a Borel action space. The reward and transition rates are unbounded, and the reward rates are allowed to take positive or negative values. First, we present new conditions for a nonhomogeneous Q(t)-process to be regular. Then, using these conditions, we give a new set of mild hypotheses that ensure the existence of ∈-optimal (∈≥0) stationary policies. We also present a ‘martingale characterization’ of an optimal stationary policy. Our results are illustrated with controlled birth and death processes.
引用
收藏
页码:195 / 216
页数:21
相关论文
共 50 条