Risk-sensitive finite-horizon piecewise deterministic Markov decision processes

被引:6
|
作者
Huang, Yonghui [1 ,3 ]
Lian, Zhaotong [2 ]
Guo, Xianping [1 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Math, Guangzhou 510275, Guangdong, Peoples R China
[2] Univ Macau, Fac Business Adm, Macau, Peoples R China
[3] Sun Yat Sen Univ, Guangdong Prov Key Lab Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Piecewise deterministic Markov decision processes; Risk sensitive; Finite horizon; Unbounded transition rates; HJB equation; Optimal policy;
D O I
10.1016/j.orl.2019.05.001
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper deals with risk-sensitive piecewise deterministic Markov decision processes, where the expected exponential utility of a finite horizon reward is to be maximized. Both the transition rates and reward functions are allowed to be unbounded. Feynman-Kac's formula is developed in our setup, using which along with an approximation technique, we establish the associated Hamilton-Jacobi-Bellman equation and the existence of risk-sensitive optimal policies under suitable conditions. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:96 / 103
页数:8
相关论文
共 50 条