A survey of recent results on continuous-time Markov decision processes

被引：56

作者：

Guo, Xianping ^{[1
]}

Hernandez-Lerma, Onesimo ^{[1
]}

Prieto-Rumeau, Tomas ^{[1
]}

机构：

[1] Zhongshan Univ, Beijing, Peoples R China

来源：

TOP | 2006年 / 14卷 / 02期

关键词：

continuous-time Markov decision processes (also known as controlled Markov chains); unbounded reward and transition rates; discounted reward; average reward; bias optimality; sensitive discount criteria;

D O I：

10.1007/BF02837562

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

This paper is a survey of recent results on continuous-time Markov decision processes (MDPs) with unbounded transition rates, and reward rates that may be unbounded from above and from below. These results pertain to discounted and average reward optimality criteria, which are the most commonly used criteria, and also to more selective concepts, such as bias optimality and sensitive discount criteria. For concreteness, we consider only MDPs with a countable state space, but we indicate how the results can be extended to more general MDPs or to Markov games.

引用

页码：177 / 243

页数：67

共 50 条

[1] A survey of recent results on continuous-time Markov decision processes
Xianping Guo
Onésimo Hernández-Lerma
Tomás Prieto-Rumeau
Xi-Ren Cao
Junyu Zhang
Qiying Hu
Mark E. Lewis
Ricardo Vélez
[J]. TOP, 2006, 14 : 177 - 261
[2] A survey of recent results on continuous-time Markov decision processes - Discussion
Hu, Qiying
[J]. TOP, 2006, 14 (02) : 248 - 251
[3] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Dufour, Francois
Piunovskiy, Alexei B.
[J]. ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
[4] Impulsive control for continuous-time Markov decision processes
Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
33405, France
不详
L69 7ZL, United Kingdom
[J]. Adv Appl Probab, 1 (106-127):
[5] The Transformation Method for Continuous-Time Markov Decision Processes
Piunovskiy, Alexey
Zhang, Yi
[J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
[6] Continuous-Time Markov Decision Processes with Controlled Observations
Huang, Yunhan
Kavitha, Veeraruna
Zhu, Quanyan
[J]. 2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 32 - 39
[7] Continuous-Time Markov Decision Processes with Exponential Utility
Zhang, Yi
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) : 2636 - 2660
[8] REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
Piunovskiy, Alexey
[J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (01) : 473 - 495
[9] The Transformation Method for Continuous-Time Markov Decision Processes
Alexey Piunovskiy
Yi Zhang
[J]. Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
[10] Delayed Nondeterminism in Continuous-Time Markov Decision Processes
Neuhaeusser, Martin R.
Stoelinga, Marielle
Katoen, Joost-Pieter
[J]. FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +

← 1 2 3 4 5 →