Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments

被引:195
|
作者
Sartor, Maureen A.
Tomlinson, Craig R.
Wesselkamper, Scott C.
Sivaganesan, Siva
Leikauf, George D.
Medvedovic, Mario [1 ]
机构
[1] Univ Cincinnati, Dept Environm Hlth, Cincinnati, OH USA
[2] Univ Cincinnati, Ctr Environm Genet, Cincinnati, OH USA
[3] Dartmouth Coll, Hitchcock Med Ctr, Dept Med Pharmacol & Toxicol, Lebanon, NH 03756 USA
[4] Univ Cincinnati, Dept Math Sci, Cincinnati, OH 45221 USA
[5] Cincinnati Childrens Hosp Med Ctr, Biomed Informat Div, Cincinnati, OH USA
关键词
D O I
10.1186/1471-2105-7-538
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The small sample sizes often used for microarray experiments result in poor estimates of variance if each gene is considered independently. Yet accurately estimating variability of gene expression measurements in microarray experiments is essential for correctly identifying differentially expressed genes. Several recently developed methods for testing differential expression of genes utilize hierarchical Bayesian models to "pool" information from multiple genes. We have developed a statistical testing procedure that further improves upon current methods by incorporating the well-documented relationship between the absolute gene expression level and the variance of gene expression measurements into the general empirical Bayes framework. Results: We present a novel Bayesian moderated-T, which we show to perform favorably in simulations, with two real, dual-channel microarray experiments and in two controlled single-channel experiments. In simulations, the new method achieved greater power while correctly estimating the true proportion of false positives, and in the analysis of two publicly-available "spike-in" experiments, the new method performed favorably compared to all tested alternatives. We also applied our method to two experimental datasets and discuss the additional biological insights as revealed by our method in contrast to the others. The R-source code for implementing our algorithm is freely available at http://eh3.uc.edu/ibmt. Conclusion: We use a Bayesian hierarchical normal model to define a novel Intensity-Based Moderated T-statistic (IBMT). The method is completely data-dependent using empirical Bayes philosophy to estimate hyperparameters, and thus does not require specification of any free parameters. IBMT has the strength of balancing two important factors in the analysis of microarray data: the degree of independence of variances relative to the degree of identity (i.e. t-tests vs. equal variance assumption), and the relationship between variance and signal intensity. When this variance-intensity relationship is weak or does not exist, IBMT reduces to a previously described moderated t-statistic. Furthermore, our method may be directly applied to any array platform and experimental design. Together, these properties show IBMT to be a valuable option in the analysis of virtually any microarray experiment.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments
    Maureen A Sartor
    Craig R Tomlinson
    Scott C Wesselkamper
    Siva Sivaganesan
    George D Leikauf
    Mario Medvedovic
    [J]. BMC Bioinformatics, 7
  • [2] An efficient method to identify differentially expressed genes in microarray experiments
    Qin, Huaizhen
    Feng, Tao
    Harding, Scott A.
    Tsai, Chung-Jui
    Zhang, Shuanglin
    [J]. BIOINFORMATICS, 2008, 24 (14) : 1583 - 1589
  • [3] An empirical bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments
    Datta, S
    Satten, GA
    Xia, JZ
    Heslin, MJ
    Datta, S
    [J]. BIOINFORMATICS, 2004, 20 (02) : 235 - 242
  • [4] Identifying differentially expressed genes in cDNA microarray experiments
    Baggerly, KA
    Coombes, KR
    Hess, KR
    Stivers, DN
    Abruzzo, LV
    Zhang, W
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 639 - 659
  • [5] Selecting differentially expressed genes from microarray experiments
    Pepe, MS
    Longton, G
    Anderson, GL
    Schummer, M
    [J]. BIOMETRICS, 2003, 59 (01) : 133 - 142
  • [6] Sample size for detecting differentially expressed genes in microarray experiments
    Caimiao Wei
    Jiangning Li
    Roger E Bumgarner
    [J]. BMC Genomics, 5
  • [7] Improving the identification of differentially expressed genes in cDNA microarray experiments
    Ultsch, A
    [J]. Classification - the Ubiquitous Challenge, 2005, : 378 - 385
  • [8] Sample size for identifying differentially expressed genes in microarray experiments
    Wang, SJ
    Chen, JJ
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (04) : 714 - 726
  • [9] Sample size for detecting differentially expressed genes in microarray experiments
    Wei, CM
    Li, JN
    Bumgarner, RE
    [J]. BMC GENOMICS, 2004, 5 (1)
  • [10] Combining Not-Proper ROC Curves and Hierarchical Clustering to Detect Differentially Expressed Genes in Microarray Experiments
    Parodi, Stefano
    Pistoia, Vito
    Muselli, Marco
    [J]. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS: 10TH INTERNATIONAL MEETING, 2014, 8452 : 238 - 247