Resampling-Based Empirical Bayes Multiple Testing Procedures for Controlling Generalized Tail Probability and Expected Value Error Rates: Focus on the False Discovery Rate and Simulation Study

被引：21

作者：

Dudoit, Sandrine ^{[1
,2
]}

Gilbert, Houston N. ^{[1
]}

van der Laan, Mark J. ^{[1
,2
]}

机构：

[1] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA

[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA

来源：

BIOMETRICAL JOURNAL | 2008年 / 50卷 / 05期

关键词：

Adaptive; Adjusted p-value; Alternative hypothesis; Bootstrap; Correlation; Cut-off; Empirical Bayes; False discovery rate; Generalized expected value error rate; Generalized tail probability error rate; Joint distribution; Linear step-up procedure; Marginal procedure; Mixture model; Multiple hypothesis testing; Non-parametric; Null distributions; Null hypothesis; Posterior probability; Power; Prior probability; Proportion of true null hypotheses; q-value; R package; Receiver operator characteristic curve; Rejection region; Resampling; Simulation study; Software; t-statistic; Test statistic; Type I error rate;

D O I：

10.1002/bimj.200710473

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This article proposes resampling-based empirical Bayes multiple testing procedures for controlling a broad class of Type I error rates, defined as generalized tail probability (gTP) error rates, gTP(q, g) = Pr (g(V-n, S-n) > q), and generalized expected value (gEV) error rates, gEV(g) = E[g(V-n, S-n)], for arbitrary functions g(V-n, S-n) of the numbers of false positives V-n and true positives S-n. Of particular interest are error rates based on the proportion g(V-n, S-n) = V-n/(V-n + S-n) of Type I errors among the rejected hypotheses, such as the false discovery rate (FDR), FDR = E[V-n/(V-n + S-n)]. The proposed procedures offer several advantages over existing methods. They provide Type I error control for general data generating distributions, with arbitrary dependence structures among variables. Gains in power are achieved by deriving rejection regions based on guessed sets of true null hypotheses and null test statistics randomly sampled from joint distributions that account for the dependence structure of the data. The Type I error and power properties of an FDR-controlling version of the resampling-based empirical Bayes approach are investigated and compared to those of widely-used FDR-controlling linear step-up procedures in a simulation study. The Type I error and power trade-off achieved by the empirical Bayes procedures under a variety of testing scenarios allows this approach to be competitive with or outperform the Storey and Tibshirani (2003) linear step-up procedure, as an alternative to the classical Benjamini and Hochberg (1995) procedure.

引用

页码：716 / 744

页数：29

共 2 条

[1] Empirical Bayes and resampling based multiple testing procedure controlling tail probability of the proportion of false positives.
van der Laan, Mark J.
Birkner, Merrill D.
Hubbard, Alan E.
[J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2005, 4
[2] Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics
Yekutieli, D
Benjamini, Y
[J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1999, 82 (1-2) : 171 - 196

← 1 →