Empirical Bayes analysis of variance component models for microarray data

被引:0
|
作者
S. Feng
R. D. Wolfinger
T. M. Chu
G. C. Gibson
L. A. McGraw
机构
[1] Duke University,Department of Biostatistics and Bioinformatics
[2] SAS Institute,Department of Genetics
[3] Inc.,Department of Genetics and Development
[4] North Carolina State University,undefined
[5] Cornell University,undefined
关键词
Microarray data analysis; ROC curves; Shrinkage estimators;
D O I
暂无
中图分类号
学科分类号
摘要
A gene-by-gene mixed model analysis is a useful statistical method for assessing significance for microarray gene differential expression. While a large amount of data on thousands of genes are collected in a microarray experiment, the sample size for each gene is usually small, which could limit the statistical power of this analysis. In this report, we introduce an empirical Bayes (EB) approach for general variance component models applied to microarray data. Within a linear mixed model framework, the restricted maximum likelihood (REML) estimates of variance components of each gene are adjusted by integrating information on variance components estimated from all genes. The approach starts with a series of single-gene analyses. The estimated variance components from each gene are transformed to the “ANOVA components”. This transformation makes it possible to independently estimate the marginal distribution of each “ANOVA component.” The modes of the posterior distributions are estimated and inversely transformed to compute the posterior estimates of the variance components. The EB statistic is constructed by replacing the REML variance estimates with the EB variance estimates in the usual t statistic. The EB approach is illustrated with a real data example which compares the effects of five different genotypes of male flies on post-mating gene expression in female flies. In a simulation study, the ROC curves are applied to compare the EB statistic and two other statistics. The EB statistic was found to be the most powerful of the three. Though the null distribution of the EB statistic is unknown, a t distribution may be used to provide conservative control of the false positive rate.
引用
收藏
相关论文
共 50 条
  • [1] Empirical Bayes analysis of variance component models for microarray data
    Feng, S.
    Wolfinger, R. D.
    Chu, T. M.
    Gibson, G. C.
    McGraw, L. A.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2006, 11 (02) : 197 - 209
  • [2] Empirical Bayes analysis of unreplicated microarray data
    HyungJun Cho
    Jaewoo Kang
    Jae K. Lee
    Computational Statistics, 2009, 24 : 393 - 408
  • [3] Empirical Bayes analysis of unreplicated microarray data
    Cho, HyungJun
    Kang, Jaewoo
    Lee, Jae K.
    COMPUTATIONAL STATISTICS, 2009, 24 (03) : 393 - 408
  • [4] An empirical Bayes method for robust variance estimation in detecting DEGs using microarray data
    You, Na
    Wang, Xueqin
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2017, 15 (05)
  • [5] Bayes factors and approximations for variance component models
    Pauler, DK
    Wakefield, JC
    Kass, RE
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (448) : 1242 - 1253
  • [6] Empirical Bayes analysis of a microarray experiment
    Efron, B
    Tibshirani, R
    Storey, JD
    Tusher, V
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1151 - 1160
  • [7] Variance Component Estimation for Mixed Model Analysis of cDNA Microarray Data
    Sarholz, Barbara
    Piepho, Hans-Peter
    BIOMETRICAL JOURNAL, 2008, 50 (06) : 927 - 939
  • [8] β-empirical Bayes inference and model diagnosis of microarray data
    Mohammad Manir Hossain Mollah
    M Nurul Haque Mollah
    Hirohisa Kishino
    BMC Bioinformatics, 13
  • [9] β-empirical Bayes inference and model diagnosis of microarray data
    Mollah, Mohammad Manir Hossain
    Mollah, M. Nurul Haque
    Kishino, Hirohisa
    BMC BIOINFORMATICS, 2012, 13
  • [10] Analysis of variance of microarray data
    Ayroles, Julien F.
    Gibson, Greg
    DNA MICROARRAYS, PART B: DATABASES AND STATISTICS, 2006, 411 : 214 - +