Variational method for estimating the rate of convergence of Markov-chain Monte Carlo algorithms

被引:8
|
作者
Casey, Fergal P. [1 ,6 ]
Waterfall, Joshua J. [2 ]
Gutenkunst, Ryan N. [3 ]
Myers, Christopher R. [4 ]
Sethna, James P. [5 ]
机构
[1] Univ Coll Dublin, Complex & Adapt Syst Lab, Dublin 4, Ireland
[2] Cornell Univ, Dept Mol Biol & Genet, Ithaca, NY 14853 USA
[3] Cornell Univ, Dept Biol Stat & Computat Biol, Ithaca, NY 14853 USA
[4] Cornell Univ, Computat Biol Serv Unit, Life Sci Core Labs Ctr, Ithaca, NY 14853 USA
[5] Cornell Univ, Atom & Solid State Phys Lab, Ithaca, NY 14853 USA
[6] Univ Coll Dublin, Conway Inst Biomol & Biomed Res, Dublin 4, Ireland
来源
PHYSICAL REVIEW E | 2008年 / 78卷 / 04期
关键词
D O I
10.1103/PhysRevE.78.046704
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
We demonstrate the use of a variational method to determine a quantitative lower bound on the rate of convergence of Markov chain Monte Carlo (MCMC) algorithms as a function of the target density and proposal density. The bound relies on approximating the second largest eigenvalue in the spectrum of the MCMC operator using a variational principle and the approach is applicable to problems with continuous state spaces. We apply the method to one dimensional examples with Gaussian and quartic target densities, and we contrast the performance of the random walk Metropolis-Hastings algorithm with a "smart" variant that incorporates gradient information into the trial moves, a generalization of the Metropolis adjusted Langevin algorithm. We find that the variational method agrees quite closely with numerical simulations. We also see that the smart MCMC algorithm often fails to converge geometrically in the tails of the target density except in the simplest case we examine, and even then care must be taken to choose the appropriate scaling of the deterministic and random parts of the proposed moves. Again, this calls into question the utility of smart MCMC in more complex problems. Finally, we apply the same method to approximate the rate of convergence in multidimensional Gaussian problems with and without importance sampling. There we demonstrate the necessity of importance sampling for target densities which depend on variables with a wide range of scales.
引用
收藏
页数:12
相关论文
共 50 条