No unbiased estimator of the variance of K-fold cross-validation

被引:0
|
作者
Bengio, Y [1 ]
Grandvalet, Y [1 ]
机构
[1] Univ Montreal, Dept IRO, Montreal, PQ H3C 3J7, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most machine learning researchers perform quantitative experiments to estimate generalization error and compare algorithm performances. In order to draw statistically convincing conclusions, it is important to estimate the uncertainty of such estimates. This paper studies the estimation of uncertainty around the K-fold cross-validation estimator. The main theorem shows that there exists no universal unbiased estimator of the variance of K-fold cross-validation. An analysis based on the eigendecomposition of the covariance matrix of errors helps to better understand the nature of the problem and shows that naive estimators may grossly underestimate variance, as conpoundrmed by numerical experiments.
引用
收藏
页码:513 / 520
页数:8
相关论文
共 50 条
  • [31] Study on the Impact of Partition-Induced Dataset Shift on k-fold Cross-Validation
    Garcia Moreno-Torres, Jose
    Saez, Jose A.
    Herrera, Francisco
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (08) : 1304 - 1312
  • [32] Limitations on Low Variance k-Fold Cross Validation in Learning Set of Rules Inducers
    Vasinek, Michal
    Platos, Jan
    Snasel, Vaclav
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS), 2016, : 207 - 214
  • [33] Displacement prediction of landslide based on generalized regression neural networks with K-fold cross-validation
    Jiang, Ping
    Chen, Jiejie
    NEUROCOMPUTING, 2016, 198 : 40 - 47
  • [34] Evaluating the Performance of the K-fold Cross-Validation Approach for Model Selection in Growth Mixture Modeling
    He, Jinbo
    Fan, Xitao
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2019, 26 (01) : 66 - 79
  • [35] Uncertain Johnson-Schumacher growthmodel with imprecise observations and k-fold cross-validation test
    Fang, Liang
    Liu, Shiqin
    Huang, Zhiyong
    SOFT COMPUTING, 2020, 24 (04) : 2715 - 2720
  • [36] Uncertain Johnson–Schumacher growth model with imprecise observations and k-fold cross-validation test
    Liang Fang
    Shiqin Liu
    Zhiyong Huang
    Soft Computing, 2020, 24 : 2715 - 2720
  • [37] K-Fold Cross-Validation through Identification of the Opinion Classification Algorithm for the Satisfaction of University Students
    Chamorro-Atalaya, Omar
    Arevalo-Tuesta, Jose
    Balarezo-Mares, Denisse
    Gonzales-Pacheco, Anthony
    Mendoza-Leon, Olga
    Tomas-Quispe, Gregorio
    Suarez-Bazalar, Raul
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (11) : 140 - 158
  • [38] Efficient approximate k-fold and leave-one-out cross-validation for ridge regression
    Meijer, Rosa J.
    Goeman, Jelle J.
    BIOMETRICAL JOURNAL, 2013, 55 (02) : 141 - 155
  • [39] Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification
    Yadav, Sanjay
    Shukla, Sanyam
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 78 - 83
  • [40] On the bias of K-fold cross validation with stable learners
    Aghbalou, Anass
    Portier, Francois
    Sabourin, Anne
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206