Stochastic gradient Hamiltonian Monte Carlo with variance reduction for Bayesian inference

被引：1

作者：

Zhize Li

Tianyi Zhang

Shuyu Cheng

Jun Zhu

Jian Li

机构：

[1] Tsinghua University,

来源：

Machine Learning | 2019年 / 108卷

关键词：

Hamiltonian Monte Carlo; Variance reduction; Bayesian inference;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Gradient-based Monte Carlo sampling algorithms, like Langevin dynamics and Hamiltonian Monte Carlo, are important methods for Bayesian inference. In large-scale settings, full-gradients are not affordable and thus stochastic gradients evaluated on mini-batches are used as a replacement. In order to reduce the high variance of noisy stochastic gradients, Dubey et al. (in: Advances in neural information processing systems, pp 1154–1162, 2016) applied the standard variance reduction technique on stochastic gradient Langevin dynamics and obtained both theoretical and experimental improvements. In this paper, we apply the variance reduction tricks on Hamiltonian Monte Carlo and achieve better theoretical convergence results compared with the variance-reduced Langevin dynamics. Moreover, we apply the symmetric splitting scheme in our variance-reduced Hamiltonian Monte Carlo algorithms to further improve the theoretical results. The experimental results are also consistent with the theoretical results. As our experiment shows, variance-reduced Hamiltonian Monte Carlo demonstrates better performance than variance-reduced Langevin dynamics in Bayesian regression and classification tasks on real-world datasets.

引用

页码：1701 / 1727

页数：26

共 50 条

[41] Separable Shadow Hamiltonian Hybrid Monte Carlo for Bayesian Neural Network Inference in wind speed forecasting
Mbuvha, Rendani
Mongwe, Wilson Tsakane
Marwala, Tshilidzi
[J]. ENERGY AND AI, 2021, 6
[42] Monte Carlo transition dynamics and variance reduction
Fitzgerald, M
Picard, RR
Silver, RN
[J]. JOURNAL OF STATISTICAL PHYSICS, 2000, 98 (1-2) : 321 - 345
[43] Variance reduction for multivariate Monte Carlo simulation
Wang, Jr-Yan
[J]. JOURNAL OF DERIVATIVES, 2008, 16 (01): : 7 - 28
[44] Monte Carlo Transition Dynamics and Variance Reduction
M. Fitzgerald
R. R. Picard
R. N. Silver
[J]. Journal of Statistical Physics, 2000, 98 : 321 - 345
[45] Stochastic Gradient Descent as Approximate Bayesian Inference
Mandt, Stephan
Hoffman, Matthew D.
Blei, David M.
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
[46] Some adaptive Monte Carlo methods for Bayesian inference
Tierney, L
Mira, A
[J]. STATISTICS IN MEDICINE, 1999, 18 (17-18) : 2507 - 2515
[47] Reflections on Bayesian inference and Markov chain Monte Carlo
Craiu, Radu, V
Gustafson, Paul
Rosenthal, Jeffrey S.
[J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (04): : 1213 - 1227
[48] Stochastic Gradient Markov Chain Monte Carlo
Nemeth, Christopher
Fearnhead, Paul
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (533) : 433 - 450
[49] Bayesian inference, Monte Carlo sampling and operational risk
Peters, G. W.
Sisson, S. A.
[J]. JOURNAL OF OPERATIONAL RISK, 2006, 1 (03): : 27 - 50
[50] Some adaptive Monte Carlo methods for Bayesian inference
Tierney, L
[J]. MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 552 - 552

← 1 2 3 4 5 →