Conditions for the existence of the forecast horizons in discounted Markov decision processes.

被引:0
|
作者
Cruz-Suárez, D [1 ]
机构
[1] Univ Juarez Autonoma Tabasco, Div Acad Ciencias Basicas, Cunduacan 86690, Tabasco, Mexico
关键词
Discounted Markov Decision Processes; Forecast Horizon; dynamic programming equation; value iteration;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with stationary Markov Decision Processes (MDPs) with a finite and an infinite horizons and expected total discounted cost (see [1], [6] and [7]). For simplicity, in this work, we consider that both X and A are finite. We denote a Discounted Markov Decision Process (MDP) with the infinite horizon by M-infinity, and Discounted MDP with the finite horizon by M-n, where n is a positive integer. For M-infinity, we present one condition (see [5]), which ensures the existence of a positive integer N* called Forecast Horizon (FH). The knowledge of the existence of N* permits to reduce the problem of the determination of an optimal policy for M-infinity, just to the determination of an optimal policy for M-N*.
引用
收藏
页码:207 / 211
页数:5
相关论文
共 50 条