Improving cross-validated bandwidth selection using subsampling-extrapolation techniques

被引:6
|
作者
Wang, Qing [1 ]
Lindsay, Bruce G. [2 ]
机构
[1] Williams Coll, Dept Math & Stat, Williamstown, MA 01267 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Bandwidth selection; Cross-validation; Extrapolation; L-2; distance; Nonparametric kernel density estimator; Subsampling; DENSITY-ESTIMATION; MODEL SELECTION;
D O I
10.1016/j.csda.2015.03.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Cross-validation methodologies have been widely used as a means of selecting tuning parameters in nonparametric statistical problems. In this paper we focus on a new method for improving the reliability of cross-validation. We implement this method in the context of the kernel density estimator, where one needs to select the bandwidth parameter so as to minimize L-2 risk. This method is a two-stage subsampling-extrapolation bandwidth selection procedure, which is realized by first evaluating the risk at a fictional sample size m (m <= sample size n) and then extrapolating the optimal bandwidth from m to n. This two-stage method can dramatically reduce the variability of the conventional unbiased cross-validation bandwidth selector. This simple first-order extrapolation estimator is equivalent to the rescaled "bagging-CV" bandwidth selector in Hall and Robinson (2009) if one sets the bootstrap size equal to the fictional sample size. However, our simplified expression for the risk estimator enables us to compute the aggregated risk without any bootstrapping. Furthermore, we developed a second-order extrapolation technique as an extension designed to improve the approximation of the true optimal bandwidth. To select the optimal choice of the Fictional size m given a sample of size n, we propose a nested cross-validation methodology. Based on simulation study, the proposed new methods show promising performance across a wide selection of distributions. In addition, we also investigated the asymptotic properties of the proposed bandwidth selectors. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:51 / 71
页数:21
相关论文
共 50 条
  • [21] DOCUMENTING THE COST-EFFECTIVENESS OF OUTPATIENT SERVICES USING CROSS-VALIDATED NONRANDOMIZED COMPARISON METHODS
    PASTORELLO, T
    GERONTOLOGIST, 1983, 23 : 74 - 74
  • [22] How to avoid mismodelling in GLM-based fMRI data analysis: cross-validated Bayesian model selection
    Soch, Joram
    Haynes, John-Dylan
    Allefeld, Carsten
    NEUROIMAGE, 2016, 141 : 469 - 489
  • [23] Early detection of chronic kidney disease using recursive feature elimination and cross-validated XGBoost model
    Kumar, Mukesh
    INTERNATIONAL JOURNAL OF COMPUTATIONAL MATERIALS SCIENCE AND ENGINEERING, 2023,
  • [24] Early detection of chronic kidney disease using recursive feature elimination and cross-validated XGBoost model
    Kumar, Mukesh
    INTERNATIONAL JOURNAL OF COMPUTATIONAL MATERIALS SCIENCE AND ENGINEERING, 2023,
  • [25] Individualized, cross-validated prediction of future dementia using cognitive assessments in people with mild cognitive symptoms
    Borland, Emma
    Mattson-Carlgren, Niklas
    Tideman, Pontus
    Stomrud, Erik
    Hansson, Oskar
    Palmqvist, Sebastian
    ALZHEIMERS & DEMENTIA, 2024,
  • [26] Exploring individual and group differences in latent brain networks using cross-validated simultaneous component analysis
    Helwig, Nathaniel E.
    Snodgress, Matthew A.
    NEUROIMAGE, 2019, 201
  • [27] Estimation of Heat-Attributable Mortality Using the Cross-Validated Best Temperature Metric in Switzerland and South Korea
    Lee, Jae Young
    Roosli, Martin
    Ragettli, Martina S.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (12)
  • [28] Estimating badger social-group abundance in the Republic of Ireland using cross-validated species distribution modelling
    Byrne, Andrew W.
    Acevedo, Pelayo
    Green, Stuart
    O'Keeffe, James
    ECOLOGICAL INDICATORS, 2014, 43 : 94 - 102
  • [29] CROSS-VALIDATED R(2)-GUIDED REGION SELECTION FOR COMPARATIVE MOLECULAR-FIELD ANALYSIS - A SIMPLE METHOD TO ACHIEVE CONSISTENT RESULTS
    CHO, SJ
    TROPSHA, A
    JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (07) : 1060 - 1066
  • [30] AN OPTIMIZED COMFA STUDY, WITH CROSS-VALIDATED R2-GUIDED REGION SELECTION, OF HIV-1 INTEGRASE INHIBITORS USING MM3 (94)
    RAMANATHAN, CS
    TAYLOR, EW
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1995, 210 : 171 - MEDI