An Improved Cross-Validated Adversarial Validation Method

被引:0
|
作者
Zhang, Wen [1 ]
Liu, Zhengjiang [1 ]
Xue, Yan [2 ]
Wang, Ruibo [3 ]
Cao, Xuefei [1 ]
Li, Jihong [3 ]
机构
[1] Shanxi Univ, Sch Automat & Software Engn, Taiyuan 030006, Peoples R China
[2] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
[3] Shanxi Univ, Sch Modern Educ Technol, Taiyuan 030006, Peoples R China
关键词
Adversarial Validation; Cross Validation; Algorithm Comparison; Significance Testing; Distribution Shift; DATASET SHIFT; TESTS;
D O I
10.1007/978-3-031-40283-8_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a widely-used strategy among Kaggle competitors, adversarial validation provides a novel selection framework of a reasonable training and validation sets. An adversarial validation heavily depends on an accurate identification of the difference between the distributions of the training and test sets released in a Kaggle competition. However, the typical adversarial validation merely uses a K-fold cross-validated point estimator to measure the difference regardless of the variation of the estimator. Therefore, the typical adversarial validation tends to produce unpromising false positive conclusions. In this study, we reconsider the adversarial validation from a perspective of algorithm comparison. Specifically, we formulate the adversarial validation into a comparison task of a well-trained classifier with a random-guessing classifier on an adversarial data set. Then, we investigate the state-of-the-art algorithm comparison methods to improve the adversarial validation method for reducing false positive conclusions. We conducted sufficient simulated and real-world experiments, and we showed the recently-proposed 5 x 2 BCV McNemar's test can significantly improve the performance of the adversarial validation method.
引用
收藏
页码:343 / 353
页数:11
相关论文
共 50 条
  • [41] BAYESIAN CONFIDENCE-INTERVALS FOR THE CROSS-VALIDATED SMOOTHING SPLINE
    WAHBA, G
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1983, 45 (01): : 133 - 150
  • [42] Bias of exploratory and cross-validated DETECT index under uniclimensionality
    Monahan, Patrick O.
    Stump, Timothy E.
    Finch, Holmes
    Hambleton, Ronald K.
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2007, 31 (06) : 483 - 503
  • [44] A cross-validated cytoarchitectonic atlas of the human ventral visual stream
    Rosenke, Mona
    Weiner, Kevin S.
    Barnett, Michael A.
    Zilles, Karl
    Amunts, Katrin
    Goebel, Rainer
    Grill-Spector, Kalanit
    NEUROIMAGE, 2018, 170 : 257 - 270
  • [45] Influence Diagnostics for the Cross-Validated Smoothing Parameter in Kernel Smoothing
    JIANG Jiancheng(Department of Probability and Statistics
    Journal of Systems Science and Systems Engineering, 1996, (04) : 385 - 390
  • [46] Cross-validated estimations in the single-functional index model
    Ait-Saidi, Ahmed
    Ferraty, Frederic
    Kassa, Rabah
    Vieu, Philippe
    STATISTICS, 2008, 42 (06) : 475 - 494
  • [47] A new symptom model for autism cross-validated in an independent sample
    Boomsma, A.
    Van Lang, N. D. J.
    De Jonge, M. V.
    De Bildt, A. A.
    Van Engeland, H.
    Minderaa, R. B.
    JOURNAL OF CHILD PSYCHOLOGY AND PSYCHIATRY, 2008, 49 (08) : 809 - 816
  • [48] ELECTRODERMAL INDICANTS OF AROUSAL IN BRAIN DAMAGE - CROSS-VALIDATED FINDINGS
    PARSONS, OA
    CHANDLER, PJ
    PSYCHOPHYSIOLOGY, 1969, 5 (06) : 644 - &
  • [49] Miscellanea Bagging cross-validated bandwidths with application to big data
    Barreiro-Ures, D.
    Cao, R.
    Francisco-Fernandez, M.
    Hart, J. D.
    BIOMETRIKA, 2021, 108 (04) : 981 - 988
  • [50] Model selection for probabilistic clustering using cross-validated likelihood
    Padhraic Smyth
    Statistics and Computing, 2000, 10 : 63 - 72