Subspace Inference for Bayesian Deep Learning

被引:0
|
作者
Izmailov, Pavel [1 ]
Maddox, Wesley J. [1 ]
Kirichenko, Polina [1 ]
Garipov, Timur [4 ]
Vetrov, Dmitry [2 ,3 ]
Wilson, Andrew Gordon [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Higher Sch Econ, Moscow, Russia
[3] Samsung HSE Lab, Moscow, Russia
[4] Samsung AI Ctr Moscow, Moscow, Russia
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian inference was once a gold standard for learning with neural networks, providing accurate full predictive distributions and well calibrated uncertainty. However, scaling Bayesian inference techniques to deep neural networks is challenging due to the high dimensionality of the parameter space. In this paper, we construct low-dimensional subspaces of parameter space, such as the first principal components of the stochastic gradient descent (SGD) trajectory, which contain diverse sets of high performing models. In these subspaces, we are able to apply elliptical slice sampling and variational inference, which struggle in the full parameter space. We show that Bayesian model averaging over the induced posterior in these subspaces produces accurate predictions and well-calibrated predictive uncertainty for both regression and image classification.
引用
收藏
页码:1169 / 1179
页数:11
相关论文
共 50 条
  • [31] Unsupervised Deep Learning for Subspace Clustering
    Sekmen, Ali
    Koku, Ahmet Bugra
    Parlaktuna, Mustafa
    Abdul-Malek, Ayad
    Vanamala, Nagendrababu
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2089 - 2094
  • [32] Neuronal integration of dynamic sources: Bayesian learning and Bayesian inference
    Siegelmann, Hava T.
    Holzman, Lars E.
    CHAOS, 2010, 20 (03)
  • [33] Deep Learning Type Inference
    Hellendoorn, Vincent J.
    Bird, Christian
    Barr, Earl T.
    Allamanis, Miltiadis
    ESEC/FSE'18: PROCEEDINGS OF THE 2018 26TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2018, : 152 - 162
  • [34] Instance Specific Metric Subspace Learning: A Bayesian Approach
    Ye, Han-Jia
    Zhan, De-Chuan
    Jiang, Yuan
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2272 - 2278
  • [35] Subspace Constrained Variational Bayesian Inference for Structured Compressive Sensing With a Dynamic Grid
    Liu, An
    Zhou, Yufan
    Xu, Wenkang
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2025, 73 : 781 - 794
  • [36] Impact-force identification using deep learning and Bayesian inference with application on pipeline structures
    Hamadneh, Mu'men
    Mustapha, Samir
    Fakih, Mohammad Ali
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2024,
  • [37] Deep-learning-enabled Bayesian inference of fuel magnetization in magnetized liner inertial fusion
    Lewis, William E.
    Knapp, Patrick F.
    Slutz, Stephen A.
    Schmit, Paul F.
    Chandler, Gordon A.
    Gomez, Matthew R.
    Harvey-Thompson, Adam J.
    Mangan, Michael A.
    Ampleford, David J.
    Beckwith, Kristian
    PHYSICS OF PLASMAS, 2021, 28 (09)
  • [38] Deep learning for population size history inference: Design, comparison and combination with approximate Bayesian computation
    Sanchez, Theophile
    Cury, Jean
    Charpiat, Guillaume
    Jay, Flora
    MOLECULAR ECOLOGY RESOURCES, 2021, 21 (08) : 2645 - 2660
  • [39] A BAYESIAN APPROACH FOR EXTREME LEARNING MACHINE-BASED SUBSPACE LEARNING
    Iosifidis, Alexandros
    Gabbouj, Moncef
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2356 - 2360
  • [40] A distributed learning algorithm for Bayesian inference networks
    Lam, W
    Segre, AM
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 93 - 105