VALUE ITERATION IN COUNTABLE STATE AVERAGE COST MARKOV DECISION PROCESSES WITH UNBOUNDED COSTS

被引:14
|
作者
Sennott, Linn I. [1 ]
机构
[1] Illinois State Univ, Normal, IL 61761 USA
关键词
D O I
10.1007/BF02055585
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We deal with countable state Markov decision processes with finite action sets and (possibly) unbounded costs. Assuming the existence of an expected average cost optimal stationary policy f, with expected average cost g, when can f and g be found using undiscounted value iteration? We give assumptions guaranteeing the convergence of a quantity related to ng-v(n)(i), where v(n)(i) is the minimum expected n-stage cost when the process starts in state i. The theory is applied to a queueing system with variable service rates and to a queueing system with variable arrival parameter.
引用
收藏
页码:261 / 271
页数:11
相关论文
共 50 条