Resampling-based multiple testing for microarray data analysis

被引:231
|
作者
Ge, YC
Dudoit, S
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Parkville, Vic, Australia
关键词
multiple testing; family-wise error rate; false discovery rate; adjusted p-value; fast algorithm; minP; microarray;
D O I
10.1007/BF02595811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The burgeoning field of genomics has revived interest in multiple testing procedures by raising new methodological and computational challenges. For example, microarray experiments generate large multiplicity problems in which thousands of hypotheses are tested simultaneously. Westfall and Young (1993) propose resampling-based p-value adjustment procedures which are highly relevant to microarray experiments. This article discusses different criteria for error control in resampling-based multiple testing, including (a) the family wise error rate of Westfall and Young (1993) and (b) the false discovery rate developed by Benjamini and Hochberg (1995), both from a frequentist viewpoint; and (c) the positive false discovery rate of Storey (2002a), which has a Bayesian motivation. We also introduce our recently developed fast algorithm for implementing the minP adjustment to control family-wise error rate. Adjusted p-values for different approaches are applied to gene expression data from two recently published microarray studies. The properties of these procedures for multiple testing are compared.
引用
收藏
页码:1 / 77
页数:77
相关论文
共 50 条
  • [1] Resampling-based multiple testing for microarray data analysis
    Youngchao Ge
    Sandrine Dudoit
    Terence P. Speed
    Test, 2003, 12 : 1 - 77
  • [2] Choice of a null distribution in resampling-based multiple testing
    Pollard, KS
    van der Laan, MJ
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 125 (1-2) : 85 - 100
  • [3] Resampling-based stepwise multiple testing procedures with applications to clinical trial data
    He, Jiwei
    Li, Feng
    Gao, Yan
    Rothmann, Mark
    PHARMACEUTICAL STATISTICS, 2021, 20 (02) : 297 - 313
  • [4] Resampling-based multiple comparison procedure with application to point-wise testing with functional data
    Vsevolozhskaya, Olga A.
    Greenwood, Mark C.
    Powell, Scott L.
    Zaykin, Dmitri V.
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2015, 22 (01) : 45 - 59
  • [5] Resampling-based multiple comparison procedure with application to point-wise testing with functional data
    Olga A. Vsevolozhskaya
    Mark C. Greenwood
    Scott L. Powell
    Dmitri V. Zaykin
    Environmental and Ecological Statistics, 2015, 22 : 45 - 59
  • [6] Statistical properties of an early stopping rule for resampling-based multiple testing
    Jiang, Hui
    Salzman, Julia
    BIOMETRIKA, 2012, 99 (04) : 973 - 980
  • [7] Resampling-based methods for the analysis of multiple endpoints in clinical trials
    Reitmeir, P
    Wassmer, G
    STATISTICS IN MEDICINE, 1999, 18 (24) : 3453 - 3462
  • [8] SCOPE OF RESAMPLING-BASED TESTS IN fNIRS NEUROIMAGING DATA ANALYSIS
    Singh, Archana K.
    Clowney, Lester
    Okamoto, Masakc
    Cole, James B.
    Dan, Ippeita
    STATISTICA SINICA, 2008, 18 (04) : 1519 - 1534
  • [9] Resampling-based Methods in Single and Multiple Testing for Equality of Covariance/Correlation Matrices
    Yang, Yang
    DeGruttola, Victor
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2012, 8 (01):
  • [10] Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data
    Stefano Monti
    Pablo Tamayo
    Jill Mesirov
    Todd Golub
    Machine Learning, 2003, 52 : 91 - 118