Subspace Inference for Bayesian Deep Learning

被引:0
|
作者
Izmailov, Pavel [1 ]
Maddox, Wesley J. [1 ]
Kirichenko, Polina [1 ]
Garipov, Timur [4 ]
Vetrov, Dmitry [2 ,3 ]
Wilson, Andrew Gordon [1 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Higher Sch Econ, Moscow, Russia
[3] Samsung HSE Lab, Moscow, Russia
[4] Samsung AI Ctr Moscow, Moscow, Russia
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian inference was once a gold standard for learning with neural networks, providing accurate full predictive distributions and well calibrated uncertainty. However, scaling Bayesian inference techniques to deep neural networks is challenging due to the high dimensionality of the parameter space. In this paper, we construct low-dimensional subspaces of parameter space, such as the first principal components of the stochastic gradient descent (SGD) trajectory, which contain diverse sets of high performing models. In these subspaces, we are able to apply elliptical slice sampling and variational inference, which struggle in the full parameter space. We show that Bayesian model averaging over the induced posterior in these subspaces produces accurate predictions and well-calibrated predictive uncertainty for both regression and image classification.
引用
收藏
页码:1169 / 1179
页数:11
相关论文
共 50 条
  • [41] Learning Bayesian networks with low inference complexity
    Benjumeda M.
    Larrañaga P.
    Bielza C.
    Progress in Artificial Intelligence, 2016, 5 (1) : 15 - 26
  • [42] Learning of Laser Dynamics using Bayesian Inference
    Zibar, Darko
    Schaeffer, Christian
    Mork, Jesper
    2018 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2018,
  • [43] Learning summary statistics for Bayesian inference with Autoencoders
    Albert, Carlo
    Ulzega, Simone
    Perez-Cruz, Fernando
    Ozdemir, Firat
    Mira, Antonietta
    SCIPOST PHYSICS CORE, 2022, 5 (03):
  • [44] Learning Fast-Inference Bayesian Networks
    Ramaswamy, Vaidyanathan Peruvemba
    Szeider, Stefan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [45] Online reinforcement learning control by Bayesian inference
    Xia, Zhongpu
    Zhao, Dongbin
    IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (12): : 1331 - 1338
  • [46] Deep Bayesian Multimedia Learning
    Chien, Jen-Tzung
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4791 - 4793
  • [47] A Survey on Bayesian Deep Learning
    Wang, Hao
    Yeung, Dit-Yan
    ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [48] Bayesian Compression for Deep Learning
    Louizos, Christos
    Ullrich, Karen
    Welling, Max
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [49] Deep Learning: A Bayesian Perspective
    Polson, Nicholas G.
    Sokolov, Vadim
    BAYESIAN ANALYSIS, 2017, 12 (04): : 1275 - 1304
  • [50] Deep Learning and Bayesian Methods
    Prosper, Harrison B.
    XIITH QUARK CONFINEMENT AND THE HADRON SPECTRUM, 2017, 137