Optimal stopping time on discounted semi-Markov processes

被引：0

作者：

Fang Chen

Xianping Guo

Zhong-Wei Liao

机构：

[1] Sun Yat-Sen University,School of Mathematics

[2] Beijing Normal University,College of Education for the Future

来源：

Frontiers of Mathematics in China | 2021年 / 16卷

关键词：

Optimal stopping time; semi-Markov processes (SMPs); value function; semi-Markov decision processes (SMDPs); optimal policy; iterative algorithm; 90C40; 93E20; 60G40;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper attempts to study the optimal stopping time for semi-Markov processes (SMPs) under the discount optimization criteria with unbounded cost rates. In our work, we introduce an explicit construction of the equivalent semi-Markov decision processes (SMDPs). The equivalence is embodied in the expected discounted cost functions of SMPs and SMDPs, that is, every stopping time of SMPs can induce a policy of SMDPs such that the value functions are equal, and vice versa. The existence of the optimal stopping time of SMPs is proved by this equivalence relation. Next, we give the optimality equation of the value function and develop an effective iterative algorithm for computing it. Moreover, we show that the optimal and ε-optimal stopping time can be characterized by the hitting time of the special sets. Finally, to illustrate the validity of our results, an example of a maintenance system is presented in the end.

引用

页码：303 / 324

页数：21

共 50 条

[31] Customizing exponential semi-Markov decision processes under the discounted cost criterion
Cekyay, Bora
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 266 (01) : 168 - 178
[32] Using Semi-Markov Chains to Solve Semi-Markov Processes
Bei Wu
Brenda Ivette Garcia Maya
Nikolaos Limnios
Methodology and Computing in Applied Probability, 2021, 23 : 1419 - 1431
[33] A RANDOM TIME CHANGE RELATING SEMI-MARKOV AND MARKOV PROCESSES
YACKEL, J
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (02): : 358 - &
[34] COMPARISON OF SEMI-MARKOV AND MARKOV PROCESSES
KURTZ, TG
ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (03): : 991 - &
[35] Partially observable semi-Markov games with discounted payoff
Ghosh, Mrinal K.
Goswami, Anindya
STOCHASTIC ANALYSIS AND APPLICATIONS, 2006, 24 (05) : 1035 - 1059
[36] Semi-Markov control processes with unknown holding times distribution under a discounted criterion
Luque-Vásquez, F
Minjárez-Sosa, JA
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 61 (03) : 455 - 468
[37] Correction to: Using Semi-Markov Chains to Solve Semi-Markov Processes
Bei Wu
Brenda Ivette Garcia Maya
Nikolaos Limnios
Methodology and Computing in Applied Probability, 2021, 23 (4) : 1433 - 1434
[38] Semi-Markov control processes with unknown holding times distribution under a discounted criterion
Fernando Luque-Vásquez
J. Adolfo Minjárez-Sosa
Mathematical Methods of Operations Research, 2005, 61 : 455 - 468
[39] NONSTATIONARY VALUE-ITERATION AND ADAPTIVE-CONTROL OF DISCOUNTED SEMI-MARKOV PROCESSES
HERNANDEZLERMA, O
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1985, 112 (02) : 435 - 445
[40] A UNIFIED APPROACH TO ALGORITHMS WITH A SUBOPTIMALITY TEST IN DISCOUNTED SEMI-MARKOV DECISION-PROCESSES
OHNO, K
JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1981, 24 (04) : 296 - 324

← 1 2 3 4 5 →