Average cost criterion induced by the regular utility function for continuous-time Markov decision processes

被引:5
|
作者
Wei, Qingda [1 ]
Chen, Xian [2 ]
机构
[1] Huaqiao Univ, Sch Econ & Finance, Quanzhou 362021, Peoples R China
[2] Xiamen Univ, Sch Math Sci, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Continuous-time Markov decision processes; Regular utility function; U-average cost criterion; Optimal policy; RISK-SENSITIVE CONTROL; OPTIMALITY;
D O I
10.1007/s10626-017-0237-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the average cost criterion induced by the regular utility function (U-average cost criterion) for continuous-time Markov decision processes. This criterion is a generalization of the risk-sensitive average cost and expected average cost criteria. We first introduce an auxiliary risk-sensitive first passage optimization problem and obtain the properties of the corresponding optimal value function under the slight conditions. Then we show that the pair of the optimal value functions of the risk-sensitive average cost criterion and the risk-sensitive first passage criterion is a solution to the optimality equation of the risk-sensitive average cost criterion allowing the risk-sensitivity parameter to take any nonzero value. Moreover, we have that the optimal value function of the risk-sensitive average cost criterion is continuous with respect to the risk-sensitivity parameter. Finally, we give the connections between the U-average cost criterion and the average cost criteria induced by the identity function and the exponential utility function, and prove the existence of a U-average optimal deterministic stationary policy in the class of all randomized Markov policies.
引用
收藏
页码:501 / 524
页数:24
相关论文
共 50 条