β-empirical Bayes inference and model diagnosis of microarray data

被引:6
|
作者
Mollah, Mohammad Manir Hossain [1 ]
Mollah, M. Nurul Haque [2 ]
Kishino, Hirohisa [1 ]
机构
[1] Univ Tokyo, Grad Sch Agr & Life Sci, Bunkyo Ku, Tokyo 1138657, Japan
[2] Rajshahi Univ, Dept Stat, Rajshahi 6205, Bangladesh
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
DIFFERENTIALLY EXPRESSED GENES; STATISTICAL-METHODS; ENRICHMENT ANALYSIS; ROBUST; DIVERGENCE;
D O I
10.1186/1471-2105-13-135
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Microarray data enables the high-throughput survey of mRNA expression profiles at the genomic level; however, the data presents a challenging statistical problem because of the large number of transcripts with small sample sizes that are obtained. To reduce the dimensionality, various Bayesian or empirical Bayes hierarchical models have been developed. However, because of the complexity of the microarray data, no model can explain the data fully. It is generally difficult to scrutinize the irregular patterns of expression that are not expected by the usual statistical gene by gene models. Results: As an extension of empirical Bayes (EB) procedures, we have developed the beta-empirical Bayes (beta-EB) approach based on a beta-likelihood measure which can be regarded as an 'evidence-based' weighted (quasi-) likelihood inference. The weight of a transcript t is described as a power function of its likelihood, f(beta)(gamma(t)vertical bar theta). Genes with low likelihoods have unexpected expression patterns and low weights. By assigning low weights to outliers, the inference becomes robust. The value of beta, which controls the balance between the robustness and efficiency, is selected by maximizing the predictive beta(0)-likelihood by cross-validation. The proposed beta-EB approach identified six significant (p < 10(-5)) contaminated transcripts as differentially expressed (DE) in normal/tumor tissues from the head and neck of cancer patients. These six genes were all confirmed to be related to cancer; they were not identified as DE genes by the classical EB approach. When applied to the eQTL analysis of Arabidopsis thaliana, the proposed beta-EB approach identified some potential master regulators that were missed by the EB approach. Conclusions: The simulation data and real gene expression data showed that the proposed beta-EB method was robust against outliers. The distribution of the weights was used to scrutinize the irregular patterns of expression and diagnose the model statistically. When beta-weights outside the range of the predicted distribution were observed, a detailed inspection of the data was carried out. The beta-weights described here can be applied to other likelihood-based statistical models for diagnosis, and may serve as a useful tool for transcriptome and proteome studies.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] β-empirical Bayes inference and model diagnosis of microarray data
    Mohammad Manir Hossain Mollah
    M Nurul Haque Mollah
    Hirohisa Kishino
    BMC Bioinformatics, 13
  • [2] Empirical Bayes inference for the Weibull model
    M. Maswadah
    Computational Statistics, 2013, 28 : 2849 - 2859
  • [3] Empirical Bayes inference for the Weibull model
    Maswadah, M.
    COMPUTATIONAL STATISTICS, 2013, 28 (06) : 2849 - 2859
  • [4] Empirical Bayes analysis of unreplicated microarray data
    HyungJun Cho
    Jaewoo Kang
    Jae K. Lee
    Computational Statistics, 2009, 24 : 393 - 408
  • [5] Empirical Bayes analysis of unreplicated microarray data
    Cho, HyungJun
    Kang, Jaewoo
    Lee, Jae K.
    COMPUTATIONAL STATISTICS, 2009, 24 (03) : 393 - 408
  • [6] Prior robust empirical Bayes inference for large-scale data by conditioning on rank with application to microarray data
    Liao, J. G.
    McMurry, Timothy
    Berg, Arthur
    BIOSTATISTICS, 2014, 15 (01) : 60 - 73
  • [7] Empirical Bayes analysis of variance component models for microarray data
    S. Feng
    R. D. Wolfinger
    T. M. Chu
    G. C. Gibson
    L. A. McGraw
    Journal of Agricultural, Biological, and Environmental Statistics, 2006, 11
  • [8] Empirical Bayes analysis of variance component models for microarray data
    Feng, S.
    Wolfinger, R. D.
    Chu, T. M.
    Gibson, G. C.
    McGraw, L. A.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2006, 11 (02) : 197 - 209
  • [9] Bayes and Empirical Bayes Inference in Changepoint Problems
    Lian, Heng
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (03) : 419 - 430
  • [10] Empirical Bayes and Selective Inference
    Rasines, Daniel Garcia
    Young, G. Alastair
    JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 2022, 102 (04) : 1205 - 1217