Resampling-based multiple testing for microarray data analysis

被引:231
|
作者
Ge, YC
Dudoit, S
Speed, TP
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[3] Walter & Eliza Hall Inst Med Res, Div Genet & Bioinformat, Parkville, Vic, Australia
关键词
multiple testing; family-wise error rate; false discovery rate; adjusted p-value; fast algorithm; minP; microarray;
D O I
10.1007/BF02595811
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The burgeoning field of genomics has revived interest in multiple testing procedures by raising new methodological and computational challenges. For example, microarray experiments generate large multiplicity problems in which thousands of hypotheses are tested simultaneously. Westfall and Young (1993) propose resampling-based p-value adjustment procedures which are highly relevant to microarray experiments. This article discusses different criteria for error control in resampling-based multiple testing, including (a) the family wise error rate of Westfall and Young (1993) and (b) the false discovery rate developed by Benjamini and Hochberg (1995), both from a frequentist viewpoint; and (c) the positive false discovery rate of Storey (2002a), which has a Bayesian motivation. We also introduce our recently developed fast algorithm for implementing the minP adjustment to control family-wise error rate. Adjusted p-values for different approaches are applied to gene expression data from two recently published microarray studies. The properties of these procedures for multiple testing are compared.
引用
收藏
页码:1 / 77
页数:77
相关论文
共 50 条
  • [21] Appropriateness of some resampling-based inference procedures for assessing performance of prognostic classifiers derived from microarray data
    Lusa, Lara
    McShane, Lisa M.
    Radmacher, Michael D.
    Shih, Joanna H.
    Wright, George W.
    Simon, Richard
    STATISTICS IN MEDICINE, 2007, 26 (05) : 1102 - 1113
  • [23] Resampling-Based Similarity Measures for High-Dimensional Data
    Amaratunga, Dhammika
    Cabrera, Javier
    Lee, Yung-Seop
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (01) : 54 - 62
  • [24] A Bayesian approach to efficient differential allocation for resampling-based significance testing
    Jensen, Shane T.
    Soi, Sameer
    Wang, Li-San
    BMC BIOINFORMATICS, 2009, 10
  • [25] A Bayesian approach to efficient differential allocation for resampling-based significance testing
    Shane T Jensen
    Sameer Soi
    Li-San Wang
    BMC Bioinformatics, 10
  • [26] Multiple testing in the survival analysis of microarray data
    Correa, JA
    Dudoit, S
    Goldstein, DR
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2002, 10 : 298 - 298
  • [27] Resampling-based confidence regions and multiple tests for a correlated random vector
    Arlot, Sylvain
    Blanchard, Gilles
    Roquain, Etienne
    LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 127 - +
  • [28] Resampling-based noise correction for crowdsourcing
    Xu, Wenqiang
    Jiang, Liangxiao
    Li, Chaoqun
    Journal of Experimental and Theoretical Artificial Intelligence, 2021, 33 (06): : 985 - 999
  • [29] Resampling-Based Change Point Estimation
    Fiosina, Jelena
    Fiosins, Maksims
    ADVANCES IN INTELLIGENT DATA ANALYSIS X: IDA 2011, 2011, 7014 : 150 - 161
  • [30] Resampling-based selective clustering ensembles
    Hong, Yi
    Kwong, Sam
    Wang, Hanli
    Ren, Qjngsheng
    PATTERN RECOGNITION LETTERS, 2009, 30 (03) : 298 - 305