How good are the Bayesian information criterion and the minimum description length principle for model selection A Bayesian network analysis

被引:0
|
作者
Cruz-Ramirez, Nicandro [1 ]
Acosta-Mesa, Hector-Gabriel [1 ]
Barrientos-Martinez, Rocio-Erandi [1 ]
Nava-Fernandez, Luis-Alonso [2 ]
机构
[1] Univ Veracruzana, Fac Fis & Inteligencia Artificial, Sebastian Camacho 5,Col Ctr, Xalapa 91000, Veracruz, Mexico
[2] Univ Veracruzana, Inst Res Educ, Xalapa 91000, Veracruz, Mexico
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bayesian Information Criterion (BIC) and the Minimum Description Length Principle (MDL) have been widely proposed as good metrics for model selection. Such scores basically include two terms: one for accuracy and the other for complexity. Their philosophy is to find a model that rightly balances these terms. However, it is surprising that both metrics do often not work very well in practice for they overfit the data. In this paper, we present an analysis of the BIC and MDL scores using the framework of Bayesian networks that supports such a claim. To this end, we carry out different tests that include the recovery of gold-standard network structures as well as the construction and evaluation of Bayesian network classifiers. Finally, based on these results, we discuss the disadvantages of both metrics and propose some future work to examine these limitations more deeply.
引用
收藏
页码:494 / +
页数:3
相关论文
共 50 条
  • [41] Approximating Model Probabilities in Bayesian Information Criterion and Decision-Theoretic Approaches to Model Selection in Phylogenetics
    Evans, Jason
    Sullivan, Jack
    MOLECULAR BIOLOGY AND EVOLUTION, 2011, 28 (01) : 343 - 349
  • [42] Minimum description length model selection in associative learning
    Gallistel, C. Randy
    Wilkes, Jason T.
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2016, 11 : 8 - 13
  • [43] Model selection via Bayesian information criterion for divide-and-conquer penalized quantile regression
    Kang, Jongkyeong
    Han, Seokwon
    Bang, Sungwan
    KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (02) : 217 - 227
  • [44] Semiparametric Bayesian information criterion for model selection in ultra-high dimensional additive models
    Lian, Heng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 123 : 304 - 310
  • [45] Heteroschedasticity in survey data and model selection based on weighted Schwarz-bayesian information criterion
    Jayakumar, G. S. David Sam
    Sulthan, A.
    ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2014, 7 (02) : 199 - 217
  • [46] Gene Regulatory Network Inference Using Predictive Minimum Description Length Principle and Conditional Mutual Information
    Chaitankar, Vijender
    Mang, Chaoyang
    Ghosh, Preetam
    Perkins, Edward J.
    Gong, Ping
    Deng, Youping
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 487 - +
  • [47] Bayesian Model Selection in the Analysis of Cointegration
    Wroblewska, Justyna
    CENTRAL EUROPEAN JOURNAL OF ECONOMIC MODELLING AND ECONOMETRICS, 2009, 1 (01): : 57 - 69
  • [48] Bayesian case-deletion model complexity and information criterion
    Zhu, Hongtu
    Ibrahim, Joseph G.
    Chen, Qingxia
    STATISTICS AND ITS INTERFACE, 2014, 7 (04) : 531 - 542
  • [49] Bayesian model evidence as a practical alternative to deviance information criterion
    Pooley, C. M.
    Marion, G.
    ROYAL SOCIETY OPEN SCIENCE, 2018, 5 (03):
  • [50] Bayesian Network based Information Retrieval Model
    Garrouch, Kamel
    Omri, Mohamed Nazih
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 193 - 200