STRONG AVERAGE OPTIMALITY CRITERION FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES

被引：0

作者：

Wei, Qingda ^{[1
]}

Chen, Xian ^{[2
]}

机构：

[1] Huaqiao Univ, Sch Econ & Finance, Quanzhou 362021, Peoples R China

[2] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China

来源：

KYBERNETIKA | 2014年 / 50卷 / 06期

关键词：

continuous-time Markov decision processes; strong average optimality criterion; finite-horizon expected total cost criterion; unbounded transition rates; optimal policy; optimal value function;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with continuous-time Markov decision processes with the unbounded transition rates under the strong average cost criterion. The state and action spaces are Borel spaces, and the costs are allowed to be unbounded from above and from below. Under mild conditions, we first prove that the finite-horizon optimal value function is a solution to the optimality equation for the case of uncountable state spaces and unbounded transition rates, and that there exists an optimal deterministic Markov policy. Then, using the two average optimality inequalities, we show that the set of all strong average optimal policies coincides with the set of all average optimal policies, and thus obtain the existence of strong average optimal policies. Furthermore, employing the technique of the skeleton chains of controlled continuous-time Markov chains and Chapman Kolmogorov equation, we give a new set of sufficient conditions imposed on the primitive data of the model for the verification of the uniform exponential ergodicity of continuous-time Markov chains governed by stationary policies. Finally, we illustrate our main results with an example.

引用

页码：950 / 977

页数：28

共 50 条

[1] A note on optimality conditions for continuous-time Markov decision processes with average cost criterion
Guo, XP
Liu, K
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2001, 46 (12) : 1984 - 1989
[2] Average optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
Rieder, Ulrich
[J]. ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
[3] Verifiable conditions for average optimality of continuous-time Markov decision processes
Zou, Xiaolong
Huang, Yonghui
[J]. OPERATIONS RESEARCH LETTERS, 2016, 44 (06) : 742 - 746
[4] New sufficient conditions for average optimality in continuous-time Markov decision processes
Ye, Liuer
Guo, Xianping
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2010, 72 (01) : 75 - 94
[5] Average optimality inequality for continuous-time Markov decision processes in Polish spaces
Quanxin Zhu
[J]. Mathematical Methods of Operations Research, 2007, 66 : 299 - 313
[6] New sufficient conditions for average optimality in continuous-time Markov decision processes
Liuer Ye
Xianping Guo
[J]. Mathematical Methods of Operations Research, 2010, 72 : 75 - 94
[7] Optimality of Mixed Policies for Average Continuous-Time Markov Decision Processes with Constraints
Guo, Xianping
Zhang, Yi
[J]. MATHEMATICS OF OPERATIONS RESEARCH, 2016, 41 (04) : 1276 - 1296
[8] NEW DISCOUNT AND AVERAGE OPTIMALITY CONDITIONS FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Guo, Xianping
Ye, Liuer
[J]. ADVANCES IN APPLIED PROBABILITY, 2010, 42 (04) : 953 - 985
[9] Average optimality for continuous-time Markov decision processes with a policy iteration approach
Zhu, Quanxin
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2008, 339 (01) : 691 - 704
[10] Average optimality inequality for continuous-time Markov decision processes in Polish spaces
Zhu, Quanxin
[J]. MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2007, 66 (02) : 299 - 313

← 1 2 3 4 5 →