Bayesian Model Selection, the Marginal Likelihood, and Generalization

被引:0
|
作者
Lotfi, Sanae [1 ]
Izmailov, Pavel [1 ]
Benton, Gregory [1 ]
Goldblum, Micah [1 ]
Wilson, Andrew Gordon [1 ]
机构
[1] NYU, New York, NY 10003 USA
关键词
CHOICE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How do we compare between hypotheses that are entirely consistent with observations? The marginal likelihood (aka Bayesian evidence), which represents the probability of generating our observations from a prior, provides a distinctive approach to this foundational question, automatically encoding Occam's razor. Although it has been observed that the marginal likelihood can overfit and is sensitive to prior assumptions, its limitations for hyperparameter learning and discrete model comparison have not been thoroughly investigated. We first revisit the appealing properties of the marginal likelihood for learning constraints and hypothesis testing. We then highlight the conceptual and practical issues in using the marginal likelihood as a proxy for generalization. Namely, we show how marginal likelihood can be negatively correlated with generalization, with implications for neural architecture search, and can lead to both underfitting and overfitting in hyperparameter learning. We provide a partial remedy through a conditional marginal likelihood, which we show is more aligned with generalization, and practically valuable for large-scale hyperparameter learning, such as in deep kernel learning.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Distributed Computation for Marginal Likelihood based Model Choice
    Buchholz, Alexander
    Ahfock, Daniel
    Richardson, Sylvia
    BAYESIAN ANALYSIS, 2023, 18 (02): : 607 - 638
  • [42] Marginal likelihood estimation for the negative binomial INGARCH model
    Pei, Jian
    Zhu, Fukang
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (04) : 1814 - 1823
  • [43] GENERALIZATION OF THE BAYESIAN STEADY FORECASTING-MODEL
    SMITH, JQ
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1979, 41 (03): : 375 - 387
  • [44] EFFICIENT LIKELIHOOD BAYESIAN CONSTRAINED LOCAL MODEL
    Li, Hailiang
    Lam, Kin-Man
    Chiu, Man-Yau
    Wu, Kangheng
    Lei, Zhibin
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 763 - 768
  • [46] Model selection and model averaging in phylogenetics: Advantages of akaike information criterion and Bayesian approaches over likelihood ratio tests
    Posada, D
    Buckley, TR
    SYSTEMATIC BIOLOGY, 2004, 53 (05) : 793 - 808
  • [47] Bayesian model selection and model averaging
    Wasserman, L
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) : 92 - 107
  • [48] Bayesian empirical likelihood and variable selection for censored linear model with applications to acute myelogenous leukemia data
    Li, Chun-Jing
    Zhao, Hong-Mei
    Dong, Xiao-Gang
    INTERNATIONAL JOURNAL OF BIOMATHEMATICS, 2019, 12 (05)
  • [49] An Efficient Likelihood-Free Bayesian Computation for Model Selection and Parameter Estimation Applied to Structural Dynamics
    Ben Abdessalem, A.
    Dervilis, N.
    Wagg, D.
    Worden, K.
    STRUCTURAL HEALTH MONITORING, PHOTOGRAMMETRY & DIC, VOL 6, 2019, : 141 - 151
  • [50] Model selection by normalized maximum likelihood
    Myung, JI
    Navarro, DJ
    Pitt, MA
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2006, 50 (02) : 167 - 179