Data augmentation in Bayesian neural networks and the cold posterior effect

被引:0
|
作者
Nabarro, Seth [1 ]
Ganev, Stoil [2 ]
Garriga-Alonso, Adria [3 ]
Fortuin, Vincent [3 ,4 ]
Van der Wilk, Mark [1 ]
Aitchison, Laurence [2 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Univ Bristol, Dept Comp Sci, Bristol, Avon, England
[3] Univ Cambridge, Dept Engn, Cambridge, England
[4] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian neural networks that incorporate data augmentation implicitly use a "randomly perturbed log-likelihood [which] does not have a clean interpretation as a valid likelihood function" (Izmailov et al. 2021). Here, we provide several approaches to developing principled Bayesian neural networks incorporating data augmentation. We introduce a "finite orbit" setting which allows valid likelihoods to be computed exactly, and for the more usual "full orbit" setting we derive multi-sample bounds tighter than those used previously. These models cast light on the origin of the cold posterior effect. In particular, we find that the cold posterior effect persists even in these principled models incorporating data augmentation. This suggests that the cold posterior effect cannot be dismissed as an artifact of data augmentation using incorrect likelihoods.
引用
收藏
页码:1434 / 1444
页数:11
相关论文
共 50 条
  • [1] Posterior Uncertainty Quantification in Neural Networks using Data Augmentation
    Wu, Luhuan
    Williamson, Sinead A.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [2] How Tempering Fixes Data Augmentation in Bayesian Neural Networks
    Bachmann, Gregor
    Noci, Lorenzo
    Hofmann, Thomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Disentangling the Roles of Curation, Data-Augmentation and the Prior in the Cold Posterior Effect
    Noci, Lorenzo
    Roth, Kevin
    Bachmann, Gregor
    Nowozin, Sebastian
    Hofmann, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Learning Bayesian Networks with Incomplete Data by Augmentation
    Adel, Tameem
    de Campos, Cassio P.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1684 - 1690
  • [5] Data Augmentation for Graph Neural Networks
    Zhao, Tong
    Liu, Yozen
    Neves, Leonardo
    Woodford, Oliver
    Jiang, Meng
    Shah, Neil
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11015 - 11023
  • [6] Hydranet: Data Augmentation for Regression Neural Networks
    Dubost, Florian
    Bortsova, Gerda
    Adams, Hieab
    Ikram, M. Arfan
    Niessen, Wiro
    Vernooij, Meike
    de Bruijne, Marleen
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT IV, 2019, 11767 : 438 - 446
  • [7] Rationalizing Graph Neural Networks with Data Augmentation
    Liu, Gang
    Inae, Eric
    Luo, Tengfei
    Jiang, Meng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [8] Deep Learning Neural Networks and Bayesian Neural Networks in Data Analysis
    Chernoded, Andrey
    Dudko, Lev
    Myagkov, Igor
    Volkov, Petr
    XXIII INTERNATIONAL WORKSHOP HIGH ENERGY PHYSICS AND QUANTUM FIELD THEORY (QFTHEP 2017), 2017, 158
  • [9] Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification
    Franchi, Gianni
    Bursuc, Andrei
    Aldea, Emanuel
    Dubuisson, Severine
    Bloch, Isabelle
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2027 - 2040
  • [10] Generalized Bayesian Posterior Expectation Distillation for Deep Neural Networks
    Vadera, Meet P.
    Jalain, Brian
    Marlin, Benjamin M.
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 719 - 728