Riemannian Laplace approximations for Bayesian neural networks

被引:0
|
作者
Bergamin, Federico [1 ]
Moreno-Munoz, Pablo [1 ]
Hauberg, Soren [1 ]
Arvanitidis, Georgios [1 ]
机构
[1] Tech Univ Denmark, DTU Compute, Sect Cognit Syst, Lyngby, Denmark
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian neural networks often approximate the weight-posterior with a Gaussian distribution. However, practical posteriors are often, even locally, highly non-Gaussian, and empirical performance deteriorates. We propose a simple parametric approximate posterior that adapts to the shape of the true posterior through a Riemannian metric that is determined by the log-posterior gradient. We develop a Riemannian Laplace approximation where samples naturally fall into weight-regions with low negative log-posterior. We show that these samples can be drawn by solving a system of ordinary differential equations, which can be done efficiently by leveraging the structure of the Riemannian metric and automatic differentiation. Empirically, we demonstrate that our approach consistently improves over the conventional Laplace approximation across tasks. We further show that, unlike the conventional Laplace approximation, our method is not overly sensitive to the choice of prior, which alleviates a practical pitfall of current approaches.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] Bayesian modelling of neural networks
    Mutihac, R
    Cicuttin, A
    Estrada, AC
    Colavita, AA
    FOUNDATIONS AND TOOLS FOR NEURAL MODELING, PROCEEDINGS, VOL I, 1999, 1606 : 277 - 286
  • [32] Spatial Bayesian neural networks
    Zammit-Mangion, Andrew
    Kaminski, Michael D.
    Tran, Ba-Hien
    Filippone, Maurizio
    Cressie, Noel
    SPATIAL STATISTICS, 2024, 60
  • [33] Bayesian Quantum Neural Networks
    Nguyen, Nam
    Chen, Kwang-Cheng
    IEEE ACCESS, 2022, 10 : 54110 - 54122
  • [34] Classification with Bayesian neural networks
    Neal, Radford M.
    MACHINE LEARNING CHALLENGES: EVALUATING PREDICTIVE UNCERTAINTY VISUAL OBJECT CLASSIFICATION AND RECOGNIZING TEXTUAL ENTAILMENT, 2006, 3944 : 28 - 32
  • [35] Universal Approximations of Invariant Maps by Neural Networks
    Yarotsky, Dmitry
    CONSTRUCTIVE APPROXIMATION, 2022, 55 (01) : 407 - 474
  • [36] Universal Approximations of Invariant Maps by Neural Networks
    Dmitry Yarotsky
    Constructive Approximation, 2022, 55 : 407 - 474
  • [37] CONSTRUCTIVE APPROXIMATIONS FOR NEURAL NETWORKS BY SIGMOIDAL FUNCTIONS
    JONES, LK
    PROCEEDINGS OF THE IEEE, 1990, 78 (10) : 1586 - 1589
  • [38] Bayesian Perceptron: Towards fully Bayesian Neural Networks
    Huber, Marco F.
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3179 - 3186
  • [39] Function Space Bayesian Pseudocoreset for Bayesian Neural Networks
    Kim, Balhae
    Lee, Hyungi
    Lee, Juho
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [40] A joint Bayesian framework for missing data and measurement error using integrated nested Laplace approximations
    Skarstein, Emma
    Martino, Sara
    Muff, Stefanie
    BIOMETRICAL JOURNAL, 2023, 65 (08)