A Simple Baseline for Bayesian Uncertainty in Deep Learning

被引:0
|
作者
Maddox, Wesley J. [1 ]
Garipov, Timur [2 ]
Izmailov, Pavel [1 ]
Vetrov, Dmitry [2 ,3 ]
Wilson, Andrew Gordon [1 ]
机构
[1] NYU, New York, NY 10003 USA
[2] Samsung AI Ctr Moscow, Moscow, Russia
[3] Natl Res Univ Higher Sch Econ, Samsung HSE Lab, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose SWA-Gaussian (SWAG), a simple, scalable, and general purpose approach for uncertainty representation and calibration in deep learning. Stochastic Weight Averaging (SWA), which computes the first moment of stochastic gradient descent (SGD) iterates with a modified learning rate schedule, has recently been shown to improve generalization in deep learning. With SWAG, we fit a Gaussian using the SWA solution as the first moment and a low rank plus diagonal covariance also derived from the SGD iterates, forming an approximate posterior distribution over neural network weights; we then sample from this Gaussian distribution to perform Bayesian model averaging. We empirically find that SWAG approximates the shape of the true posterior, in accordance with results describing the stationary distribution of SGD iterates. Moreover, we demonstrate that SWAG performs well on a wide variety of tasks, including out of sample detection, calibration, and transfer learning, in comparison to many popular alternatives including MC dropout, KFAC Laplace, SGLD, and temperature scaling.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Deep Deterministic Uncertainty: A New Simple Baseline
    Mukhoti, Jishnu
    Kirsch, Andreas
    van Amersfoort, Joost
    Torr, Philip H. S.
    Gal, Yarin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24384 - 24394
  • [2] Uncertainty in Bayesian deep label distribution learning
    Zheng, Rui
    Zhang, Shulin
    Liu, Lei
    Luo, Yuhao
    Sun, Mingzhai
    [J]. APPLIED SOFT COMPUTING, 2021, 101
  • [3] PCANet: A Simple Deep Learning Baseline for Image Classification?
    Chan, Tsung-Han
    Jia, Kui
    Gao, Shenghua
    Lu, Jiwen
    Zeng, Zinan
    Ma, Yi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5017 - 5032
  • [4] Bayesian Deep Learning for Hyperspectral Image Classification With Low Uncertainty
    He, Xin
    Chen, Yushi
    Huang, Lingbo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Uncertainty utilization in fault detection using Bayesian deep learning
    Maged, Ahmed
    Xie, Min
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2022, 64 : 316 - 329
  • [6] Bayesian Uncertainty Estimation for Deep Learning Inversion of Electromagnetic Data
    Oh, Seokmin
    Byun, Joongmoo
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
    Gal, Yarin
    Ghahramani, Zoubin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [8] Decomposition of Uncertainty in Bayesian Deep Learning for Efficient and Risk-sensitive Learning
    Depeweg, Stefan
    Hernandez-Lobato, Jose Miguel
    Doshi-Velez, Finale
    Udluft, Steffen
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] A Bayesian deep learning method for freeway incident detection with uncertainty quantification
    Liu, Genwang
    Jin, Haolin
    Li, Jiaze
    Hu, Xianbiao
    Li, Jian
    [J]. ACCIDENT ANALYSIS AND PREVENTION, 2022, 176
  • [10] Modeling uncertainty to improve personalized recommendations via Bayesian deep learning
    Xin Wang
    Serdar Kadıoğlu
    [J]. International Journal of Data Science and Analytics, 2023, 16 : 191 - 201