No unbiased estimator of the variance of K-fold cross-validation

被引:0
|
作者
Bengio, Y [1 ]
Grandvalet, Y [1 ]
机构
[1] Univ Montreal, Dept IRO, Montreal, PQ H3C 3J7, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most machine learning researchers perform quantitative experiments to estimate generalization error and compare algorithm performances. In order to draw statistically convincing conclusions, it is important to estimate the uncertainty of such estimates. This paper studies the estimation of uncertainty around the K-fold cross-validation estimator. The main theorem shows that there exists no universal unbiased estimator of the variance of K-fold cross-validation. An analysis based on the eigendecomposition of the covariance matrix of errors helps to better understand the nature of the problem and shows that naive estimators may grossly underestimate variance, as conpoundrmed by numerical experiments.
引用
收藏
页码:513 / 520
页数:8
相关论文
共 50 条
  • [41] K-nearest neighbour and K-fold cross-validation used in wind turbines for false alarm detection
    Chacon, Ana Maria Peco
    Ramirez, Isaac Segovia
    Marquez, Fausto Pedro Garcia
    SUSTAINABLE FUTURES, 2023, 6
  • [42] Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: theoretical aspects
    Camacho, Jose
    Ferrer, Alberto
    JOURNAL OF CHEMOMETRICS, 2012, 26 (07) : 361 - 373
  • [43] Determining the optimal number of folds to use in a K-fold cross-validation: A neural network classification experiment
    Oyedele, Opeoluwa
    RESEARCH IN MATHEMATICS, 2023, 10 (01):
  • [44] K-fold cross-validation based frequentist model averaging for linear models with nonignorable missing responses
    Liang, Zhongqi
    Cai, Li
    Wang, Suojin
    Wang, Qihua
    STATISTICS AND COMPUTING, 2025, 35 (01)
  • [45] An Efficient Batch K-Fold Cross-Validation Voronoi Adaptive Sampling Technique for Global Surrogate Modeling
    Kaminsky, Andrew L.
    Wang, Yi
    Pant, Kapil
    JOURNAL OF MECHANICAL DESIGN, 2021, 143 (01)
  • [46] Comparative Analysis of KNN Classifier with K-Fold Cross-Validation in Acoustic-Based Gender Recognition
    Handa, Disha
    Rai, Kajal
    NEXT GENERATION OF INTERNET OF THINGS, 2023, 445 : 399 - 404
  • [47] Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects
    Camacho, Jose
    Ferrer, Alberto
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 131 : 37 - 50
  • [48] An efficient variance estimator for cross-validation under partition sampling
    Wang, Qing
    Cai, Xizhen
    STATISTICS, 2021, 55 (03) : 660 - 681
  • [49] Novel kernel density estimator based on ensemble unbiased cross-validation
    He, Yu-Lin
    Ye, Xuan
    Huang, De-Fa
    Huang, Joshua Zhexue
    Zhai, Jun-Hai
    INFORMATION SCIENCES, 2021, 581 : 327 - 344
  • [50] K-FOLD CROSS-VALIDATION FOR IMPROVING MEDICAL CLASSIFICATION ACCURACY AND MODEL SELECTION IN K-NEAREST NEIGHBORS CLASSIFIERS
    Zhao, M.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2016, 118 : 107 - 107