Resampling-based multiple testing for microarray data analysis

被引:231
|
作者
Ge, YC
Dudoit, S
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Parkville, Vic, Australia
关键词
multiple testing; family-wise error rate; false discovery rate; adjusted p-value; fast algorithm; minP; microarray;
D O I
10.1007/BF02595811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The burgeoning field of genomics has revived interest in multiple testing procedures by raising new methodological and computational challenges. For example, microarray experiments generate large multiplicity problems in which thousands of hypotheses are tested simultaneously. Westfall and Young (1993) propose resampling-based p-value adjustment procedures which are highly relevant to microarray experiments. This article discusses different criteria for error control in resampling-based multiple testing, including (a) the family wise error rate of Westfall and Young (1993) and (b) the false discovery rate developed by Benjamini and Hochberg (1995), both from a frequentist viewpoint; and (c) the positive false discovery rate of Storey (2002a), which has a Bayesian motivation. We also introduce our recently developed fast algorithm for implementing the minP adjustment to control family-wise error rate. Adjusted p-values for different approaches are applied to gene expression data from two recently published microarray studies. The properties of these procedures for multiple testing are compared.
引用
收藏
页码:1 / 77
页数:77
相关论文
共 50 条
  • [31] Resampling-based noise correction for crowdsourcing
    Xu, Wenqiang
    Jiang, Liangxiao
    Li, Chaoqun
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2021, 33 (06) : 985 - 999
  • [32] A resampling-based approach to share reference panels
    Cavinato, Theo
    Rubinacci, Simone
    Malaspinas, Anna-Sapfo
    Delaneau, Olivier
    NATURE COMPUTATIONAL SCIENCE, 2024, 4 (05): : 360 - 366
  • [33] RESAMPLING-BASED ESTIMATOR IN NONLINEAR-REGRESSION
    MONG, J
    WANG, XR
    STATISTICA SINICA, 1994, 4 (01) : 187 - 198
  • [34] Invertible Resampling-based Layered Image Compression
    Xu, Youmin
    Zhang, Jian
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 380 - 380
  • [35] Introduction to Permutation and Resampling-Based Hypothesis Tests
    LaFleur, Bonnie J.
    Greevy, Robert A.
    JOURNAL OF CLINICAL CHILD AND ADOLESCENT PSYCHOLOGY, 2009, 38 (02): : 286 - 294
  • [36] Resampling-Based Analysis of Multivariate Data and Repeated Measures Designs with the R Package MANOVA.RM
    Friedrich, Sarah
    Konietschke, Frank
    Pauly, Markus
    R JOURNAL, 2019, 11 (02): : 380 - 400
  • [37] A resampling-based method to evaluate NLI models
    Salvatore, Felipe de Souza
    Finger, Marcelo
    Hirata Jr, Roberto
    Patriota, Alexandre G.
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (04) : 793 - 820
  • [38] Tight clustering: A resampling-based approach for identifying stable and tight patterns in data
    Tseng, GC
    Wong, WH
    BIOMETRICS, 2005, 61 (01) : 10 - 16
  • [39] Improved gene prediction by resampling-based spectral analysis of DNA sequence
    Chang, C. Q.
    Fung, Peter C. W.
    Hung, Y. S.
    2008 INTERNATIONAL SPECIAL TOPIC CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, VOLS 1 AND 2, 2008, : 391 - +
  • [40] Sample size calculation for multiple testing in microarray data analysis
    Jung, SH
    Bang, H
    Young, S
    BIOSTATISTICS, 2005, 6 (01) : 157 - 169