The bootstrap: A technique for data-driven statistics. Using computer-intensive analyses to explore experimental data

被引:203
|
作者
Henderson, AR [1 ]
机构
[1] Univ Western Ontario, Dept Biochem, London, ON N6A 5C1, Canada
关键词
bootstrap; computer-intensive methods; jackknife; non-parametric statistics; permutation tests; random number generation;
D O I
10.1016/j.cccn.2005.04.002
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Background: The concept of resampling data - more commonly referred to as bootstrapping - has been in use for more than three decades. Bootstrapping has considerable theoretical advantages when it is applied to non-Gaussian data. Most of the published literature is concerned with the mathematical aspects of the bootstrap but increasingly this technique is being utilized in medical and other fields. Methods: I reviewed the published literature following a 1994 publication assessing the transfer of technology, including the bootstrap, to the biomedical literature. Results: In the ten-year period following that 1994 paper there were 1679 published references to the technique in Medline. In that same time period the following citations were found in the four major medical journals-British Medical Journal (48), JAMA (51), Lancet (52) and the New England Journal of Medicine (45). Content: I introduce the basic theory of the bootstrap, the jackknife, and permutation tests. The bootstrap is used to estimate the accuracy of an estimator such as the standard error, a confidence interval, or the bias of an estimator. The technique may be useful for analysing smallish expensive-to-collect data sets where prior information is sparse, distributional assumptions are unclear, and where further data may be difficult to acquire. Some of the elementary uses of bootstrapping are illustrated by considering the calculation of confidence intervals such as for reference ranges or for experimental data findings, hypothesis testing such as comparing experimental findings, linear regression, and correlation when studying association and prediction of variables, non-linear regression such as used in immunoassay techniques, and ROC curve processing. Conclusions: These techniques can supplement current nonparametric statistical methods and should be included, where appropriate, in the armamentarium of data processing methodologies. (c) 2005 Elsevier B.V All rights reserved.
引用
下载
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [1] Data-driven guidelines for phylogenomic analyses using SNP data
    Suissa, Jacob S.
    de la Cerda, Gisel Y.
    Graber, Leland C.
    Jelley, Chloe
    Wickell, David
    Phillips, Heather R.
    Grinage, Ayress D.
    Moreau, Corrie S.
    Specht, Chelsea D.
    Doyle, Jeff J.
    Landis, Jacob B.
    APPLICATIONS IN PLANT SCIENCES, 2024,
  • [2] Using data-driven methods to explore the predictability of surface soil moisture with FLUXNET site data
    Pan, Jinjing
    Wei Shangguan
    Li, Lu
    Yuan, Hua
    Zhang, Shupeng
    Lu, Xinjie
    Wei, Nan
    Dai, Yongjiu
    HYDROLOGICAL PROCESSES, 2019, 33 (23) : 2978 - 2996
  • [3] Technique for Data-Driven Mining in Physiological Sensor Data by Using Eclat Algorithm
    Kalbhor, Shraddha
    Kedar, S., V
    EMERGING TECHNOLOGIES IN DATA MINING AND INFORMATION SECURITY, IEMIS 2018, VOL 1, 2019, 755 : 419 - 427
  • [4] Advances in data-driven analyses and modelling using EPR-MOGA
    Giustolisi, O.
    Savic, D. A.
    JOURNAL OF HYDROINFORMATICS, 2009, 11 (3-4) : 225 - 236
  • [5] Data-driven sparse reconstruction of flow over a stalled aerofoil using experimental data
    Carter, Douglas W.
    De Voogt, Francis
    Soares, Renan
    Ganapathisubramani, Bharathram
    DATA-CENTRIC ENGINEERING, 2021, 2 (04):
  • [6] Data-Driven Tuning of State Feedback Gains with Stability Constraint Using Experimental Data
    Aoki, Shogo
    Yubai, Kazuhiro
    Yashiro, Daisuke
    Komada, Satoshi
    2016 INTERNATIONAL AUTOMATIC CONTROL CONFERENCE (CACS), 2016, : 126 - 131
  • [7] DATA-DRIVEN SINGLE IMAGE DEPTH ESTIMATION USING WEIGHTED MEDIAN STATISTICS
    Kim, Youngjung
    Choi, Sunghwan
    Sohn, Kwanghoon
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3808 - 3812
  • [8] Recovery of energy losses using an online data-driven optimization technique
    Ashuri, Turaj
    Li, Yaoyu
    Hosseini, Seyed Ehsan
    ENERGY CONVERSION AND MANAGEMENT, 2020, 225
  • [9] Polymer extrusion die design using a data-driven autoencoders technique
    Chady Ghnatios
    Eloi Gravot
    Victor Champaney
    Nicolas Verdon
    Nicolas Hascoët
    Francisco Chinesta
    International Journal of Material Forming, 2024, 17
  • [10] Polymer extrusion die design using a data-driven autoencoders technique
    Ghnatios, Chady
    Gravot, Eloi
    Champaney, Victor
    Verdon, Nicolas
    Hascoet, Nicolas
    Chinesta, Francisco
    INTERNATIONAL JOURNAL OF MATERIAL FORMING, 2024, 17 (01)