Asymptotic variance of passage time estimators in Markov chains

被引:0
|
作者
Zazanis, Michael A. [1 ]
机构
[1] Athens Univ Econ & Business, Dept Stat, Athens 10434, Greece
关键词
D O I
10.1017/S0269964807070143
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We consider the problem of estimating passage times in stochastic simulations of Markov chains. Two types of estimator are considered for this purpose: the "simple" and the "overlapping" estimator; they are compared in terms of their asymptotic variance. The analysis is based on the regenerative structure of the process and it is shown that when estimating the mean passage time, the simple estimator is always asymptotically superior. However, when the object is to estimate the expectation of a nonlinear function of the passage time, such as the probability that the passage time exceeds a given threshold, then it is shown that the overlapping estimator can be superior in some cases. Related results in the Reinforcement Learning literature are discussed.
引用
收藏
页码:217 / 234
页数:18
相关论文
共 50 条