Continuous-Time Controlled Markov Chains with Discounted Rewards

被引：0

作者：

Xianping Guo

Onésimo Hernández-Lerma

机构：

[1] Zhongshan University,The School of Mathematics and Computational Science

[2] CINVESTAV-IPN,Departamento de Matemáticas

来源：

Acta Applicandae Mathematica | 2003年 / 79卷

关键词：

continuous-time controlled Markov chains; unbounded reward and transition rates; discounted criterion; optimal stationary policies; martingale characterization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper studies denumerable state continuous-time controlled Markov chains with the discounted reward criterion and a Borel action space. The reward and transition rates are unbounded, and the reward rates are allowed to take positive or negative values. First, we present new conditions for a nonhomogeneous Q(t)-process to be regular. Then, using these conditions, we give a new set of mild hypotheses that ensure the existence of ∈-optimal (∈≥0) stationary policies. We also present a ‘martingale characterization’ of an optimal stationary policy. Our results are illustrated with controlled birth and death processes.

引用

页码：195 / 216

页数：21

共 50 条

[1] Continuous-time controlled Markov chains with discounted rewards
Guo, XP
Hernández-Lerma, O
[J]. ACTA APPLICANDAE MATHEMATICAE, 2003, 79 (03) : 195 - 216
[2] DISCOUNTED CONTINUOUS-TIME CONTROLLED MARKOV CHAINS: CONVERGENCE OF CONTROL MODELS
Prieto-Rumeau, Tomas
Hernandez-Lerma, Onesimo
[J]. JOURNAL OF APPLIED PROBABILITY, 2012, 49 (04) : 1072 - 1090
[3] Continuous-time controlled Markov chains
Guo, XP
Hernández-Lerma, O
[J]. ANNALS OF APPLIED PROBABILITY, 2003, 13 (01): : 363 - 388
[4] Continuous-time Markov decision processes with discounted rewards: The case of Polish spaces
Guo, Xianping
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2007, 32 (01) : 73 - 87
[5] Bias optimality for continuous-time controlled Markov chains
Prieto-Rumeau, T
Hernández-Lerma, O
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2006, 45 (01) : 51 - 73
[6] Continuous-time fuzzy decision processes with discounted rewards
Yoshida, Y
[J]. FUZZY SETS AND SYSTEMS, 2003, 139 (02) : 333 - 348
[7] Nonzero-sum games for continuous-time Markov chains with unbounded discounted payoffs
Guo, XP
Hernández-Lerma, O
[J]. JOURNAL OF APPLIED PROBABILITY, 2005, 42 (02) : 303 - 320
[8] Continuous-time controlled Markov chains with safety upper bound
Hsu, S. -P.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2011, 5 (02): : 397 - 401
[9] Blackwell Optimality in the Class of Markov Policies for Continuous-Time Controlled Markov Chains
Tomás Prieto-Rumeau
[J]. Acta Applicandae Mathematica, 2006, 92 : 77 - 96
[10] Blackwell optimality in the class of markov policies for continuous-time controlled markov chains
Prieto-Rumeau, Tomas
[J]. ACTA APPLICANDAE MATHEMATICAE, 2006, 92 (01) : 77 - 96

← 1 2 3 4 5 →