Semi-Markov decision processes with variance minimization criterion

被引:0
|
作者
Qingda Wei
Xianping Guo
机构
[1] Huaqiao University,School of Economics and Finance
[2] Sun Yat-Sen University,School of Mathematics and Computational Science
来源
4OR | 2015年 / 13卷
关键词
Semi-Markov decision processes; State-dependent discount factors; Discount optimality equation; Discount variance minimal policy; 93E20; 90C40;
D O I
暂无
中图分类号
学科分类号
摘要
We consider a variance minimization problem for semi-Markov decision processes with state-dependent discount factors in Borel spaces. The reward function may be unbounded from below and from above. Under suitable conditions, we first prove that the discount variance minimization criterion can be transformed into an equivalent expected discount criterion, and then show the existence of a discount variance minimal policy over the class of expected discount optimal stationary policies. Furthermore, we also give a value iteration algorithm for calculating the expected discount optimal value function. Finally, two examples are used to illustrate our results.
引用
收藏
页码:59 / 79
页数:20
相关论文
共 50 条