On the speed of convergence of value iteration on stochastic shortest-path problems

被引:15
|
作者
Bonet, Blai [1 ]
机构
[1] Univ Simon Bolivar, Dept Computac, Caracas 89000, Venezuela
关键词
markov decision processes; stochastic shortest-path problems; value iteration;
D O I
10.1287/moor.1060.0238
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We establish a bound on the convergence time of the value iteration algorithm on stochastic shortest-path problems. The bound. which applies for admissible initial vectors as, for example, J 0, implies a polynomial-time convergence of value iteration for all problems with polynormally bounded ||J*||/g. This result gives a partial answer to the open problem of bounding the convergence time of value iteration on arbitrary initial vectors. The proof is obtained by analyzing a stochastic process associated with the shortest-path problem.
引用
收藏
页码:365 / 373
页数:9
相关论文
共 50 条