ON GRADUAL-IMPULSE CONTROL OF CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH EXPONENTIAL UTILITY

被引：2

作者：

Guo, Xin ^{[1
]}

Kurushima, Aiko ^{[2
]}

Piunovskiy, Alexey ^{[3
]}

Zhang, Yi ^{[3
]}

机构：

[1] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China

[2] Sophia Univ, Dept Econ, Chiyoda Ku, 7-1 Kioi Cho, Tokyo 1028554, Japan

[3] Univ Liverpool, Dept Math Sci, Liverpool L69 72L, Merseyside, England

来源：

ADVANCES IN APPLIED PROBABILITY | 2021年 / 53卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

Continuous-time Markov decision processes; dynamic programming; gradual-impulse control; optimality equation; RISK-SENSITIVE CONTROL; COST;

D O I：

10.1017/apr.2020.64

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

引用

页码：301 / 334

页数：34

共 50 条

[21] Bisimulation and logical preservation for continuous-time Markov decision processes
Neuhaeusser, Martin R.
Katoen, Joost-Pieter
CONCUR 2007 - CONCURRENCY THEORY, PROCEEDINGS, 2007, 4703 : 412 - +
[22] Bisimulations and Logical Characterizations on Continuous-Time Markov Decision Processes
Song, Lei
Zhang, Lijun
Godskesen, Jens Chr.
VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION: (VMCAI 2014), 2014, 8318 : 98 - 117
[23] Bias optimality for multichain continuous-time Markov decision processes
Guo, Xianping
Song, XinYuan
Zhang, Junyu
OPERATIONS RESEARCH LETTERS, 2009, 37 (05) : 317 - 321
[24] A survey of recent results on continuous-time Markov decision processes
Guo, Xianping
Hernandez-Lerma, Onesimo
Prieto-Rumeau, Tomas
TOP, 2006, 14 (02) : 177 - 243
[25] RANDOMIZED AND RELAXED STRATEGIES IN CONTINUOUS-TIME MARKOV DECISION PROCESSES
Piunovskiy, Alexey
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2015, 53 (06) : 3503 - 3533
[26] Constrained total undiscounted continuous-time Markov decision processes
Guo, Xianping
Zhang, Yi
BERNOULLI, 2017, 23 (03) : 1694 - 1736
[27] Constrained Continuous-Time Markov Decision Processes on the Finite Horizon
Guo, Xianping
Huang, Yonghui
Zhang, Yi
APPLIED MATHEMATICS AND OPTIMIZATION, 2017, 75 (02): : 317 - 341
[28] A survey of recent results on continuous-time Markov decision processes
Xianping Guo
Onésimo Hernández-Lerma
Tomás Prieto-Rumeau
Xi-Ren Cao
Junyu Zhang
Qiying Hu
Mark E. Lewis
Ricardo Vélez
TOP, 2006, 14 : 177 - 261
[29] A characterization of meaningful schedulers for continuous-time Markov decision processes
Wolovick, Nicolas
Johr, Sven
FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, 2006, 4202 : 352 - 367
[30] Optimal control of average reward constrained continuous-time finite Markov Decision Processes
Feinberg, EA
PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3805 - 3810

← 1 2 3 4 5 →