Continuous-Time Markov Decision Processes with Exponential Utility

被引：28

作者：

Zhang, Yi ^{[1
]}

机构：

[1] Univ Liverpool, Dept Mat Sci, Liverpool L69 7ZL, Merseyside, England

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2017年 / 55卷 / 04期

关键词：

continuous-time Markov decision processes; exponential utility; total undiscounted criteria; risk-sensitive criterion; optimality equation; SEMI-MARKOV;

D O I：

10.1137/16M1086261

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a continuous-time Markov decision process (CTMDP) in Borel spaces, where the certainty equivalent with respect to the exponential utility of the total undiscounted cost is to be minimized. The cost rate is nonnegative. We establish the optimality equation. Under the compactness-continuity condition, we show the existence of a deterministic stationary optimal policy. We reduce the risk-sensitive CTMDP problem to an equivalent risk sensitive discrete-time Markov decision process, which is with the same state and action spaces as the original CTMDP. In particular, the value iteration algorithm for the CTMDP problem follows from this reduction. We essentially do not need to impose a condition on the growth of the transition and cost rate in the state, and the controlled process could be explosive.

引用

下载

页码：2636 / 2660

页数：25

共 50 条

[1] ON GRADUAL-IMPULSE CONTROL OF CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH EXPONENTIAL UTILITY
Guo, Xin
Kurushima, Aiko
Piunovskiy, Alexey
Zhang, Yi
ADVANCES IN APPLIED PROBABILITY, 2021, 53 (02) : 301 - 334
[2] EXPONENTIAL CONVERGENCE IN UNDISCOUNTED CONTINUOUS-TIME MARKOV DECISION CHAINS
ZIJM, WHM
MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (04) : 700 - 717
[3] The Transformation Method for Continuous-Time Markov Decision Processes
Piunovskiy, Alexey
Zhang, Yi
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 154 (02) : 691 - 712
[4] Impulsive control for continuous-time Markov decision processes
Université Bordeaux, IMB, INRIA Bordeaux Sud-Ouest, 200 Avenue de la Vieille Tour, Talence Cedex
33405, France
不详
L69 7ZL, United Kingdom
Adv Appl Probab, 1 (106-127):
[5] IMPULSIVE CONTROL FOR CONTINUOUS-TIME MARKOV DECISION PROCESSES
Dufour, Francois
Piunovskiy, Alexei B.
ADVANCES IN APPLIED PROBABILITY, 2015, 47 (01) : 106 - 127
[6] Continuous-Time Markov Decision Processes with Controlled Observations
Huang, Yunhan
Kavitha, Veeraruna
Zhu, Quanyan
2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 32 - 39
[7] REALIZABLE STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
Piunovskiy, Alexey
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2018, 56 (01) : 473 - 495
[8] The Transformation Method for Continuous-Time Markov Decision Processes
Alexey Piunovskiy
Yi Zhang
Journal of Optimization Theory and Applications, 2012, 154 : 691 - 712
[9] Delayed Nondeterminism in Continuous-Time Markov Decision Processes
Neuhaeusser, Martin R.
Stoelinga, Marielle
Katoen, Joost-Pieter
FOUNDATIONS OF SOFTWARE SCIENCE AND COMPUTATIONAL STRUCTURES, PROCEEDINGS, 2009, 5504 : 364 - +
[10] Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes
Feinberg, Eugene A.
Mandava, Manasa
Shiryaev, Albert N.
MATHEMATICS OF OPERATIONS RESEARCH, 2022, 47 (02) : 1266 - 1286

← 1 2 3 4 5 →