共 2 条
Resampling-Based Empirical Bayes Multiple Testing Procedures for Controlling Generalized Tail Probability and Expected Value Error Rates: Focus on the False Discovery Rate and Simulation Study
被引:21
|作者:
Dudoit, Sandrine
[1
,2
]
Gilbert, Houston N.
[1
]
van der Laan, Mark J.
[1
,2
]
机构:
[1] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词:
Adaptive;
Adjusted p-value;
Alternative hypothesis;
Bootstrap;
Correlation;
Cut-off;
Empirical Bayes;
False discovery rate;
Generalized expected value error rate;
Generalized tail probability error rate;
Joint distribution;
Linear step-up procedure;
Marginal procedure;
Mixture model;
Multiple hypothesis testing;
Non-parametric;
Null distributions;
Null hypothesis;
Posterior probability;
Power;
Prior probability;
Proportion of true null hypotheses;
q-value;
R package;
Receiver operator characteristic curve;
Rejection region;
Resampling;
Simulation study;
Software;
t-statistic;
Test statistic;
Type I error rate;
D O I:
10.1002/bimj.200710473
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
This article proposes resampling-based empirical Bayes multiple testing procedures for controlling a broad class of Type I error rates, defined as generalized tail probability (gTP) error rates, gTP(q, g) = Pr (g(V-n, S-n) > q), and generalized expected value (gEV) error rates, gEV(g) = E[g(V-n, S-n)], for arbitrary functions g(V-n, S-n) of the numbers of false positives V-n and true positives S-n. Of particular interest are error rates based on the proportion g(V-n, S-n) = V-n/(V-n + S-n) of Type I errors among the rejected hypotheses, such as the false discovery rate (FDR), FDR = E[V-n/(V-n + S-n)]. The proposed procedures offer several advantages over existing methods. They provide Type I error control for general data generating distributions, with arbitrary dependence structures among variables. Gains in power are achieved by deriving rejection regions based on guessed sets of true null hypotheses and null test statistics randomly sampled from joint distributions that account for the dependence structure of the data. The Type I error and power properties of an FDR-controlling version of the resampling-based empirical Bayes approach are investigated and compared to those of widely-used FDR-controlling linear step-up procedures in a simulation study. The Type I error and power trade-off achieved by the empirical Bayes procedures under a variety of testing scenarios allows this approach to be competitive with or outperform the Storey and Tibshirani (2003) linear step-up procedure, as an alternative to the classical Benjamini and Hochberg (1995) procedure.
引用
收藏
页码:716 / 744
页数:29
相关论文