Performance optimization of semi-Markov decision processes with discounted-cost criteria

被引：3

作者：

Yin, Baoqun ^{[1
]}

Li, Yanjie ^{[1
]}

Zhou, Yaping ^{[1
]}

Xi, Hongsheng ^{[1
]}

机构：

[1] Univ Sci & Technol China, Dept Automat, Hefei 230026, Anhui, Peoples R China

来源：

EUROPEAN JOURNAL OF CONTROL | 2008年 / 14卷 / 03期

关键词：

semi-Markov decision processes; discounted Poisson equation; alpha-potential; discounted-cost criteria; policy iteration; value iteration;

D O I：

10.3166/EJC.14.213-222

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We discuss the problems of discounted-cost performance optimization for a class of semi-Markov decision processes (SMDPs). We define a matrix which can be used as the infinitesimal generator of a Markov process. The discounted Poisson equation is proposed for an SMDP by using this matrix, from which the alpha-potential is defined. The optimally equation satisfied by the optimal stationary policy is given and the relation between discounted model and average model is discussed. Two iteration algorithms to find is an element of-optimal policies are proposed and the proofs of convergence of these two algorithms are given. A numerical example is provided to illustrate the application of the algorithms.

引用

页码：213 / 222

页数：10

共 50 条

[1] Constrained discounted semi-Markov decision processes
Feinberg, EA
[J]. MARKOV PROCESSES AND CONTROLLED MARKOV CHAINS, 2002, : 233 - 244
[2] A Unified Approach for Semi-Markov Decision Processes with Discounted and Average Reward Criteria
Li, Yanjie
Wang, Huijing
Chen, Haoyao
[J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1741 - 1744
[3] Customizing exponential semi-Markov decision processes under the discounted cost criterion
Cekyay, Bora
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 266 (01) : 168 - 178
[4] Mixed Markov decision processes in a semi-Markov environment with discounted criterion
Hu, QY
Wang, JL
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1998, 219 (01) : 1 - 20
[5] SEMI-MARKOV DECISION-PROCESSES WITH INCOMPLETE STATE OBSERVATION - DISCOUNTED COST CRITERION
WAKUTA, K
[J]. JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 1982, 25 (04) : 351 - 362
[6] A SECONDARY APPROACH TO THE DISCOUNTED MODEL IN SEMI-MARKOV DECISION PROCESSES
董泽清
宋京生
[J]. Science Bulletin, 1988, (06) : 448 - 454
[7] Semi-markov decision processes nonstandard criteria
Baykal-Guersoy, M.
Guersoy, K.
[J]. PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2007, 21 (04) : 635 - 657
[8] AVERAGE COST SEMI-MARKOV DECISION PROCESSES
ROSS, SM
[J]. JOURNAL OF APPLIED PROBABILITY, 1970, 7 (03) : 649 - &
[9] A SECONDARY APPROACH TO THE DISCOUNTED MODEL IN SEMI-MARKOV DECISION-PROCESSES
DONG, ZQ
SONG, JS
[J]. KEXUE TONGBAO, 1988, 33 (06): : 448 - 454
[10] Nonstationary continuous time Markov decision processes in a semi-Markov environment with discounted criterion
[J]. J Math Anal Appl, 3 (640):

← 1 2 3 4 5 →