ON GRADUAL-IMPULSE CONTROL OF CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH EXPONENTIAL UTILITY

被引：2

作者：

Guo, Xin ^{[1
]}

Kurushima, Aiko ^{[2
]}

Piunovskiy, Alexey ^{[3
]}

Zhang, Yi ^{[3
]}

机构：

[1] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China

[2] Sophia Univ, Dept Econ, Chiyoda Ku, 7-1 Kioi Cho, Tokyo 1028554, Japan

[3] Univ Liverpool, Dept Math Sci, Liverpool L69 72L, Merseyside, England

来源：

ADVANCES IN APPLIED PROBABILITY | 2021年 / 53卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

Continuous-time Markov decision processes; dynamic programming; gradual-impulse control; optimality equation; RISK-SENSITIVE CONTROL; COST;

D O I：

10.1017/apr.2020.64

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.

引用

页码：301 / 334

页数：34

共 50 条

[31] Policy learning in continuous-time Markov decision processes using Gaussian Processes
Bartocci, Ezio
Bortolussi, Luca
Brazdil, Tomas
Milios, Dimitrios
Sanguinetti, Guido
PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
[32] Discounted optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
[33] Average optimality for continuous-time Markov decision processes in Polish spaces
Guo, Xianping
Rieder, Ulrich
ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
[34] On continuous-time Markov processes in bargaining
Houba, Harold
ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
[35] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
Guo, Xianping
Vykertas, Mantas
Zhang, Yi
ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
[36] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
Liu, Qiuli
Tan, Hangsheng
Guo, Xianping
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
[37] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
Guo, Xianping
Song, Xinyuan
ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
[38] Variance minimization for continuous-time Markov decision processes: two approaches
Quan-xin Zhu
Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410
[39] MARKOV DECISION-PROCESSES WITH CONTINUOUS-TIME PARAMETER - SCHOUTEN,FAV
SCHAL, M
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1984, 16 (03) : 392 - 393
[40] A survey of recent results on continuous-time Markov decision processes - Discussion
Hu, Qiying
TOP, 2006, 14 (02) : 248 - 251

← 1 2 3 4 5 →