A RELATIONSHIP BETWEEN CROSS-VALIDATION AND VAPNIK BOUNDS ON GENERALIZATION OF LEARNING MACHINES

被引:0
|
作者
Klesk, Przemyslaw [1 ]
机构
[1] Westpomeranian Univ Technol, Dept Methods Artificial Intelligence & Appl Math, Ul Zolnierska 49, Szczecin, Poland
关键词
Statistical learning theory; Bounds on generalization; Cross-validation; Empirical risk minimization; Structural risk minimization; Vapnik-Chervonenkis dimension; ERROR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typically, the n-fold cross-validation is used both to: (1) estimate the generalization properties of a model of fixed complexity, (2) choose from a family of models of different complexities, the one with the best complexity, given a data set of certain size. Obviously, it is a time-consuming procedure. A different approach - the Structural Risk Minimization is based on generalization bounds of learning machines given by Vapnik (Vapnik, 1995a; Vapnik, 1995b). Roughly speaking, SRM is O(n) times faster than n-fold cross-validation but less accurate. We state and prove theorems, which show the probabilistic relationship between the two approaches. In particular, we show what epsilon-difference between the two, one may expect without actually performing the cross validation. We conclude the paper with results of experiments confronting the probabilistic bounds we derived.
引用
收藏
页码:5 / 17
页数:13
相关论文
共 50 条
  • [41] Exact Cross-Validation for kNN : application to passive and active learning in classification
    Celisse, Alain
    Mary-Huard, Tristan
    JOURNAL OF THE SFDS, 2011, 152 (03): : 83 - 97
  • [42] Algorithmic stability and sanity-check bounds for leave-one-out cross-validation
    Kearns, M
    Ron, D
    NEURAL COMPUTATION, 1999, 11 (06) : 1427 - 1453
  • [43] A cross-validation study on the relationship between central D2 receptor occupancy and serum perphenazine concentration
    Mirjam Talvik
    Anna-Lena Nordström
    Niels-Erik Larsen
    Aurelija Jucaite
    Simon Červenka
    Christer Halldin
    Lars Farde
    Psychopharmacology, 2004, 175 : 148 - 153
  • [44] A cross-validation study on the relationship between central D2 receptor occupancy and serum perphenazine concentration
    Talvik, M
    Nordström, AL
    Larsen, NE
    Jucaite, A
    Cervenka, S
    Halldin, C
    Farde, L
    PSYCHOPHARMACOLOGY, 2004, 175 (02) : 148 - 153
  • [45] The relationship between initial threshold, learning, and generalization in perceptual learning
    Lengyel, Gabor
    Fiser, Jozsef
    JOURNAL OF VISION, 2019, 19 (04):
  • [46] Comparisons between Three Cross-Validation Methods for Measuring Learners' Performances
    Cernezel, Ales
    Rozman, Ivan
    Brumen, Bostjan
    INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 77 - 87
  • [47] The connection between cross-validation and Akaike information criterion in a semiparametric family
    Peng, Heng
    Yan, Hongjia
    Zhang, Wenyang
    JOURNAL OF NONPARAMETRIC STATISTICS, 2013, 25 (02) : 475 - 485
  • [49] Honest leave-one-out cross-validation for estimating post-tuning generalization error
    Wang, Boxiang
    Zou, Hui
    STAT, 2021, 10 (01):
  • [50] Methodological Issues in Evaluating Machine Learning Models for EEG Seizure Prediction: Good Cross-Validation Accuracy Does Not Guarantee Generalization to New Patients
    Shafiezadeh, Sina
    Duma, Gian Marco
    Mento, Giovanni
    Danieli, Alberto
    Antoniazzi, Lisa
    Cristaldi, Fiorella Del Popolo
    Bonanni, Paolo
    Testolin, Alberto
    APPLIED SCIENCES-BASEL, 2023, 13 (07):