Riemannian Laplace approximations for Bayesian neural networks

被引:0
|
作者
Bergamin, Federico [1 ]
Moreno-Munoz, Pablo [1 ]
Hauberg, Soren [1 ]
Arvanitidis, Georgios [1 ]
机构
[1] Tech Univ Denmark, DTU Compute, Sect Cognit Syst, Lyngby, Denmark
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian neural networks often approximate the weight-posterior with a Gaussian distribution. However, practical posteriors are often, even locally, highly non-Gaussian, and empirical performance deteriorates. We propose a simple parametric approximate posterior that adapts to the shape of the true posterior through a Riemannian metric that is determined by the log-posterior gradient. We develop a Riemannian Laplace approximation where samples naturally fall into weight-regions with low negative log-posterior. We show that these samples can be drawn by solving a system of ordinary differential equations, which can be done efficiently by leveraging the structure of the Riemannian metric and automatic differentiation. Empirically, we demonstrate that our approach consistently improves over the conventional Laplace approximation across tasks. We further show that, unlike the conventional Laplace approximation, our method is not overly sensitive to the choice of prior, which alleviates a practical pitfall of current approaches.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Riemannian batch normalization for SPD neural networks
    Brooks, Daniel
    Schwander, Olivier
    Barbaresco, Frederic
    Schneider, Jean-Yves
    Cord, Matthieu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] Bayesian Optimization with Robust Bayesian Neural Networks
    Springenberg, Jost Tobias
    Klein, Aaron
    Falkner, Stefan
    Hutter, Frank
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [23] Geodesic Convolutional Neural Networks on Riemannian Manifolds
    Masci, Jonathan
    Boscaini, Davide
    Bronstein, Michael M.
    Vandergheynst, Pierre
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 832 - 840
  • [24] Riemannian Local Mechanism for SPD Neural Networks
    Chen, Ziheng
    Xu, Tianyang
    Wu, Xiao-Jun
    Wang, Rui
    Huang, Zhiwu
    Kittler, Josef
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7104 - 7112
  • [25] Robust scalable initialization for Bayesian variational inference with multi-modal Laplace approximations
    Bridgman, Wyatt
    Jones, Reese E.
    Khalil, Mohammad
    PROBABILISTIC ENGINEERING MECHANICS, 2023, 74
  • [26] Bayesian inference in neural networks
    Marzban, C
    FIRST CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1998, : J25 - J30
  • [27] Bayesian inference in neural networks
    Paige, RL
    Butler, RW
    BIOMETRIKA, 2001, 88 (03) : 623 - 641
  • [28] Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations
    Rue, Havard
    Martino, Sara
    Chopin, Nicolas
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 319 - 392
  • [29] Fast estimation of expected information gains for Bayesian experimental designs based on Laplace approximations
    Long, Quan
    Scavino, Marco
    Tempone, Raul
    Wang, Suojin
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2013, 259 : 24 - 39
  • [30] Bayesian inference in neural networks
    Marzban, C
    14TH CONFERENCE ON PROBABILITY AND STATISTICS IN THE ATMOSPHERIC SCIENCES, 1998, : J97 - J102