Query Large Scale Microarray Compendium Datasets Using a Model-Based Bayesian Approach with Variable Selection

被引:4
|
作者
Hu, Ming [1 ]
Qin, Zhaohui S. [1 ]
机构
[1] Univ Michigan, Sch Publ Hlth, Dept Biostat, Ctr Stat Genet, Ann Arbor, MI 48109 USA
来源
PLOS ONE | 2009年 / 4卷 / 02期
关键词
GENE-EXPRESSION DATA; GENOME; IDENTIFICATION; ALGORITHM; NETWORKS; PATTERNS; FAMILY; GUILT;
D O I
10.1371/journal.pone.0004495
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In microarray gene expression data analysis, it is often of interest to identify genes that share similar expression profiles with a particular gene such as a key regulatory protein. Multiple studies have been conducted using various correlation measures to identify co-expressed genes. While working well for small datasets, the heterogeneity introduced from increased sample size inevitably reduces the sensitivity and specificity of these approaches. This is because most co-expression relationships do not extend to all experimental conditions. With the rapid increase in the size of microarray datasets, identifying functionally related genes from large and diverse microarray gene expression datasets is a key challenge. We develop a model-based gene expression query algorithm built under the Bayesian model selection framework. It is capable of detecting co-expression profiles under a subset of samples/experimental conditions. In addition, it allows linearly transformed expression patterns to be recognized and is robust against sporadic outliers in the data. Both features are critically important for increasing the power of identifying co-expressed genes in large scale gene expression datasets. Our simulation studies suggest that this method outperforms existing correlation coefficients or mutual information-based query tools. When we apply this new method to the Escherichia coli microarray compendium data, it identifies a majority of known regulons as well as novel potential target genes of numerous key transcription factors.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A model-based relevance estimation approach for feature selection in microarray datasets
    Bontempi, Gianluca
    Meyer, Patrick E.
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT II, 2008, 5164 : 21 - 31
  • [2] Bayesian Variable Selection in Linear Regression in One Pass for Large Datasets
    Ordonez, Carlos
    Garcia-Alvarado, Carlos
    Baladandayuthapani, Veerabhadaran
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 9 (01)
  • [3] A simple model-based approach to variable selection in classification and clustering
    Partovi Nia, Vahid
    Davison, Anthony C.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (02): : 157 - 175
  • [4] Hierarchical model-based clustering for large datasets
    Posse, C
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2001, 10 (03) : 464 - 486
  • [5] CCFS: A cooperating coevolution technique for large scale feature selection on microarray datasets
    Ebrahimpour, Mohammad K.
    Nezamabadi-Pour, Hossein
    Eftekhari, Mandi
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 73 : 171 - 178
  • [6] Variable selection for model-based clustering
    Raftery, AE
    Dean, N
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 168 - 178
  • [7] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Celeux, Gilles
    Maugis-Rabusseau, Cathy
    Sedki, Mohammed
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (01) : 259 - 278
  • [8] Variable selection in model-based clustering and discriminant analysis with a regularization approach
    Gilles Celeux
    Cathy Maugis-Rabusseau
    Mohammed Sedki
    [J]. Advances in Data Analysis and Classification, 2019, 13 : 259 - 278
  • [9] Variable selection for model-based high-dimensional clustering and its application to microarray data
    Wang, Sijian
    Zhu, Ji
    [J]. BIOMETRICS, 2008, 64 (02) : 440 - 448
  • [10] A Trust Model-Based Bayesian Decision Theory in Large Scale Internet of Things
    Kurniawan, Agus
    Kyas, Marcel
    [J]. 2015 IEEE TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING (ISSNIP), 2015,