ON GRADUAL-IMPULSE CONTROL OF CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH EXPONENTIAL UTILITY

被引:2
|
作者
Guo, Xin [1 ]
Kurushima, Aiko [2 ]
Piunovskiy, Alexey [3 ]
Zhang, Yi [3 ]
机构
[1] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
[2] Sophia Univ, Dept Econ, Chiyoda Ku, 7-1 Kioi Cho, Tokyo 1028554, Japan
[3] Univ Liverpool, Dept Math Sci, Liverpool L69 72L, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
Continuous-time Markov decision processes; dynamic programming; gradual-impulse control; optimality equation; RISK-SENSITIVE CONTROL; COST;
D O I
10.1017/apr.2020.64
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value function using the optimality equation, we reduce the gradual-impulse control problem to an equivalent simple discrete-time Markov decision process, whose action space is the union of the sets of gradual and impulsive actions.
引用
收藏
页码:301 / 334
页数:34
相关论文
共 50 条
  • [31] Policy learning in continuous-time Markov decision processes using Gaussian Processes
    Bartocci, Ezio
    Bortolussi, Luca
    Brazdil, Tomas
    Milios, Dimitrios
    Sanguinetti, Guido
    PERFORMANCE EVALUATION, 2017, 116 : 84 - 100
  • [32] Discounted optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    2006 CHINESE CONTROL CONFERENCE, VOLS 1-5, 2006, : 1785 - 1787
  • [33] Average optimality for continuous-time Markov decision processes in Polish spaces
    Guo, Xianping
    Rieder, Ulrich
    ANNALS OF APPLIED PROBABILITY, 2006, 16 (02): : 730 - 756
  • [34] On continuous-time Markov processes in bargaining
    Houba, Harold
    ECONOMICS LETTERS, 2008, 100 (02) : 280 - 283
  • [35] ABSORBING CONTINUOUS-TIME MARKOV DECISION PROCESSES WITH TOTAL COST CRITERIA
    Guo, Xianping
    Vykertas, Mantas
    Zhang, Yi
    ADVANCES IN APPLIED PROBABILITY, 2013, 45 (02) : 490 - 519
  • [36] Denumerable continuous-time Markov decision processes with multiconstraints on average costs
    Liu, Qiuli
    Tan, Hangsheng
    Guo, Xianping
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2012, 43 (03) : 576 - 585
  • [37] DISCOUNTED CONTINUOUS-TIME CONSTRAINED MARKOV DECISION PROCESSES IN POLISH SPACES
    Guo, Xianping
    Song, Xinyuan
    ANNALS OF APPLIED PROBABILITY, 2011, 21 (05): : 2016 - 2049
  • [38] Variance minimization for continuous-time Markov decision processes: two approaches
    Quan-xin Zhu
    Applied Mathematics-A Journal of Chinese Universities, 2010, 25 : 400 - 410
  • [40] A survey of recent results on continuous-time Markov decision processes - Discussion
    Hu, Qiying
    TOP, 2006, 14 (02) : 248 - 251