Double Empirical Bayes Testing

被引:2
|
作者
Tansey, Wesley [1 ]
Wang, Yixin [2 ]
Rabadan, Raul [3 ]
Blei, David [2 ,4 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
[2] Columbia Univ, Dept Stat, New York, NY USA
[3] Columbia Univ, Med Ctr, Dept Syst Biol, New York, NY USA
[4] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
cancer drug studies; empirical Bayes; knockoffs; multiple testing; two‐ groups model; FALSE DISCOVERY RATE; DRUG-SENSITIVITY; IDENTIFICATION; INACTIVATION;
D O I
10.1111/insr.12430
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Analysing data from large-scale, multiexperiment studies requires scientists to both analyse each experiment and to assess the results as a whole. In this article, we develop double empirical Bayes testing (DEBT), an empirical Bayes method for analysing multiexperiment studies when many covariates are gathered per experiment. DEBT is a two-stage method: in the first stage, it reports which experiments yielded significant outcomes and in the second stage, it hypothesises which covariates drive the experimental significance. In both of its stages, DEBT builds on the work of Efron, who laid out an elegant empirical Bayes approach to testing. DEBT enhances this framework by learning a series of black box predictive models to boost power and control the false discovery rate. In Stage 1, it uses a deep neural network prior to report which experiments yielded significant outcomes. In Stage 2, it uses an empirical Bayes version of the knockoff filter to select covariates that have significant predictive power of Stage 1 significance. In both simulated and real data, DEBT increases the proportion of discovered significant outcomes and selects more features when signals are weak. In a real study of cancer cell lines, DEBT selects a robust set of biologically plausible genomic drivers of drug sensitivity and resistance in cancer.
引用
收藏
页码:S91 / S113
页数:23
相关论文
共 50 条