The bootstrap: A technique for data-driven statistics. Using computer-intensive analyses to explore experimental data

被引:203
|
作者
Henderson, AR [1 ]
机构
[1] Univ Western Ontario, Dept Biochem, London, ON N6A 5C1, Canada
关键词
bootstrap; computer-intensive methods; jackknife; non-parametric statistics; permutation tests; random number generation;
D O I
10.1016/j.cccn.2005.04.002
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
Background: The concept of resampling data - more commonly referred to as bootstrapping - has been in use for more than three decades. Bootstrapping has considerable theoretical advantages when it is applied to non-Gaussian data. Most of the published literature is concerned with the mathematical aspects of the bootstrap but increasingly this technique is being utilized in medical and other fields. Methods: I reviewed the published literature following a 1994 publication assessing the transfer of technology, including the bootstrap, to the biomedical literature. Results: In the ten-year period following that 1994 paper there were 1679 published references to the technique in Medline. In that same time period the following citations were found in the four major medical journals-British Medical Journal (48), JAMA (51), Lancet (52) and the New England Journal of Medicine (45). Content: I introduce the basic theory of the bootstrap, the jackknife, and permutation tests. The bootstrap is used to estimate the accuracy of an estimator such as the standard error, a confidence interval, or the bias of an estimator. The technique may be useful for analysing smallish expensive-to-collect data sets where prior information is sparse, distributional assumptions are unclear, and where further data may be difficult to acquire. Some of the elementary uses of bootstrapping are illustrated by considering the calculation of confidence intervals such as for reference ranges or for experimental data findings, hypothesis testing such as comparing experimental findings, linear regression, and correlation when studying association and prediction of variables, non-linear regression such as used in immunoassay techniques, and ROC curve processing. Conclusions: These techniques can supplement current nonparametric statistical methods and should be included, where appropriate, in the armamentarium of data processing methodologies. (c) 2005 Elsevier B.V All rights reserved.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [21] Design and experimental evaluation of a data-driven PID controller using cerebellar memory
    Li, Zhifeng
    Hiraoka, Kei
    Yamamoto, Toru
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (11): : 1371 - 1382
  • [22] USING DATA-DRIVEN EEG AND ERP COMPONENTS TO EXPLORE EEG-ERP DYNAMICS: A FEASIBILITY STUDY
    Barry, Robert
    De Blasio, Frances
    PSYCHOPHYSIOLOGY, 2017, 54 : S68 - S68
  • [23] Experimental Evaluation of a Data-Driven Control System using an Electronic Thermal Regulator
    Okubo, Yuka
    Ashida, Yoichiro
    Kinoshita, Takuya
    Yamamoto, Toru
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2018, 5 (02): : 89 - 92
  • [24] Data-Driven Modeling and Experimental Validation of Autonomous Vehicles using Koopman Operator
    Joglekar, Ajinkya
    Sutavani, Sarang
    Samak, Chinmay
    Samak, Tanmay
    Kosaraju, Krishna Chaitanya
    Smereka, Jonathon
    Gorsich, David
    Vaidya, Umesh
    Krovi, Venkat
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9442 - 9447
  • [25] Data-Driven Bifurcation Analysis of Experimental Aeroelastic Systems Using Preflutter Measurements
    Perez, Jesus Garcia
    Ghadami, Amin
    Sanches, Leonardo
    Epureanu, Bogdan I.
    Michon, Guilhem
    AIAA JOURNAL, 2024, 62 (05) : 1906 - 1914
  • [26] A data-driven methodology for bridge indirect health monitoring using unsupervised computer vision
    Hurtado, A. Calderon
    Alamdari, M. Makki
    Atroshchenko, E.
    Chang, K. C.
    Kim, C. W.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 210
  • [27] MANIPULATOR CONTROL USING A DATA-DRIVEN MULTI-PROCESSOR COMPUTER SYSTEM.
    Egan, G.K.
    Richardson, C.P.
    Transactions of the Institution of Engineers, Australia. Mechanical engineering, 1985, ME 10 (03): : 218 - 222
  • [28] Computer simulations of lung airway structures using data-driven surface modeling techniques
    Spencer, RM
    Schroeter, JD
    Martonen, TB
    COMPUTERS IN BIOLOGY AND MEDICINE, 2001, 31 (06) : 499 - 511
  • [29] A Data-Driven Inductor Modeling Technique Using Parametric Circuit Simulation and Deep Learning
    Motomatsu, Takehiro
    Koga, Takahiro
    Shigei, Noritaka
    Yamaguchi, Masahiro
    Itagaki, Atsushi
    Ishizuka, Yoichi
    IEEE TRANSACTIONS ON MAGNETICS, 2023, 59 (11)
  • [30] Practical considerations for experimental designs of spatially autocorrelated data using computer intensive methods
    Hoef, Jay M. Ver
    STATISTICAL METHODOLOGY, 2012, 9 (1-2) : 172 - 184