Risk-sensitive infinite-horizon discounted piecewise deterministic Markov decision processes

被引:0
|
作者
Yonghui Huang
Zhaotong Lian
Xianping Guo
机构
[1] Sun Yat-Sen University,School of Mathematics
[2] Faculty of Business Administration,undefined
[3] University of Macau,undefined
来源
Operational Research | 2022年 / 22卷
关键词
Piecewise deterministic Markov decision processes; Risk sensitive; Discounted cost; HJB equation; Non-stationarity; 90C40; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
This paper deals with risk-sensitive piecewise deterministic Markov decision processes, where the expected exponential utility of an infinite-horizon discounted cost is minimized. Both the transition rate and cost rate are allowed to be unbounded. Based on a dynamic programming observation, we introduce an auxiliary function with the time as an additional variable to analyze the problem, which is different from those with the risk-sensitive parameter as an additional variable in previous works. Under suitable assumptions, we derive the associated Feynman-Kac’s formula, and then establish the associated Hamilton–Jacobi–Bellman equation with the time as a differential variable, which leads to the existence of optimal policies depending on the time, explicitly showing that the risk-sensitive discounted optimal policies are not stationary.
引用
收藏
页码:5791 / 5816
页数:25
相关论文
共 50 条