Bayesian Model Selection, the Marginal Likelihood, and Generalization

被引:0
|
作者
Lotfi, Sanae [1 ]
Izmailov, Pavel [1 ]
Benton, Gregory [1 ]
Goldblum, Micah [1 ]
Wilson, Andrew Gordon [1 ]
机构
[1] NYU, New York, NY 10003 USA
关键词
CHOICE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How do we compare between hypotheses that are entirely consistent with observations? The marginal likelihood (aka Bayesian evidence), which represents the probability of generating our observations from a prior, provides a distinctive approach to this foundational question, automatically encoding Occam's razor. Although it has been observed that the marginal likelihood can overfit and is sensitive to prior assumptions, its limitations for hyperparameter learning and discrete model comparison have not been thoroughly investigated. We first revisit the appealing properties of the marginal likelihood for learning constraints and hypothesis testing. We then highlight the conceptual and practical issues in using the marginal likelihood as a proxy for generalization. Namely, we show how marginal likelihood can be negatively correlated with generalization, with implications for neural architecture search, and can lead to both underfitting and overfitting in hyperparameter learning. We provide a partial remedy through a conditional marginal likelihood, which we show is more aligned with generalization, and practically valuable for large-scale hyperparameter learning, such as in deep kernel learning.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Improving Marginal Likelihood Estimation for Bayesian Phylogenetic Model Selection
    Xie, Wangang
    Lewis, Paul O.
    Fan, Yu
    Kuo, Lynn
    Chen, Ming-Hui
    SYSTEMATIC BIOLOGY, 2011, 60 (02) : 150 - 160
  • [2] Bayesian Allocation Model: Marginal Likelihood-Based Model Selection for Count Tensors
    Yldrm, Sinan
    Kurutmaz, M. Burak
    Barsbey, Melih
    Simsekli, Umut
    Cemgil, A. Taylan
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (03) : 560 - 573
  • [3] Bayesian model selection for structural damage identification: comparative analysis of marginal likelihood estimators
    Castello, Daniel Alves
    de Sousa, Luiza Freire Cesar
    da Silva, Gabriel Lucas Sousa
    Machado, Marcela Rodrigues
    JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2024, 46 (08)
  • [4] Model selection by pathwise marginal likelihood thresholding
    Di Caterina, Claudia
    Ferrari, Davide
    STATISTICS & PROBABILITY LETTERS, 2024, 214
  • [5] Predictive likelihood for Bayesian model selection and averaging
    Ando, Tomohiro
    Tsay, Ruey
    INTERNATIONAL JOURNAL OF FORECASTING, 2010, 26 (04) : 744 - 763
  • [6] Marginal Likelihood Based Model Comparison in Fuzzy Bayesian Learning
    Pan, Indranil
    Bester, Dirk
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (06): : 794 - 799
  • [7] Copula, marginal distributions and model selection: a Bayesian note
    Ralph dos Santos Silva
    Hedibert Freitas Lopes
    Statistics and Computing, 2008, 18 : 313 - 320
  • [8] Bayesian model selection for sand with generalization ability evaluation
    Jin, Yin-Fu
    Yin, Zhen-Yu
    Zhou, Wan-Huan
    Shao, Jian-Fu
    INTERNATIONAL JOURNAL FOR NUMERICAL AND ANALYTICAL METHODS IN GEOMECHANICS, 2019, 43 (14) : 2305 - 2327
  • [9] Bayesian marginal model selection for low rank sources
    Radich, BM
    Buckley, KM
    NINTH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1998, : 268 - 271
  • [10] Copula, marginal distributions and model selection: a Bayesian note
    Silva, Ralph dos Santos
    Lopes, Hedibert Freitas
    STATISTICS AND COMPUTING, 2008, 18 (03) : 313 - 320