How good are the Bayesian information criterion and the minimum description length principle for model selection A Bayesian network analysis

被引:0
|
作者
Cruz-Ramirez, Nicandro [1 ]
Acosta-Mesa, Hector-Gabriel [1 ]
Barrientos-Martinez, Rocio-Erandi [1 ]
Nava-Fernandez, Luis-Alonso [2 ]
机构
[1] Univ Veracruzana, Fac Fis & Inteligencia Artificial, Sebastian Camacho 5,Col Ctr, Xalapa 91000, Veracruz, Mexico
[2] Univ Veracruzana, Inst Res Educ, Xalapa 91000, Veracruz, Mexico
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bayesian Information Criterion (BIC) and the Minimum Description Length Principle (MDL) have been widely proposed as good metrics for model selection. Such scores basically include two terms: one for accuracy and the other for complexity. Their philosophy is to find a model that rightly balances these terms. However, it is surprising that both metrics do often not work very well in practice for they overfit the data. In this paper, we present an analysis of the BIC and MDL scores using the framework of Bayesian networks that supports such a claim. To this end, we carry out different tests that include the recovery of gold-standard network structures as well as the construction and evaluation of Bayesian network classifiers. Finally, based on these results, we discuss the disadvantages of both metrics and propose some future work to examine these limitations more deeply.
引用
收藏
页码:494 / +
页数:3
相关论文
共 50 条