Mixture modeling of microarray gene expression data

被引:0
|
作者
Yang Yang
Adam P Tashman
Jung Yeon Lee
Seungtai Yoon
Wenyang Mao
Kwangmi Ahn
Wonkuk Kim
Nancy R Mendell
Derek Gordon
Stephen J Finch
机构
[1] Stony Brook University,Department of Applied Mathematics and Statistics
[2] Cold Spring Harbor Laboratory,Department of Health Evaluation Sciences
[3] Cold Spring Harbor,Department of Genetics
[4] A210,undefined
[5] Penn State College of Medicine,undefined
[6] Rutgers University,undefined
关键词
Concordance Rate; Mixture Distribution; Bayesian Posterior Probability; Mixture Analysis; Gene Expression Variable;
D O I
10.1186/1753-6561-1-S1-S50
中图分类号
学科分类号
摘要
About 28% of genes appear to have an expression pattern that follows a mixture distribution. We use first- and second-order partial correlation coefficients to identify trios and quartets of non-sex-linked genes that are highly associated and that are also mixtures. We identified 18 trio and 35 quartet mixtures and evaluated their mixture distribution concordance. Concordance was defined as the proportion of observations that simultaneously fall in the component with the higher mean or simultaneously in the component with the lower mean based on their Bayesian posterior probabilities. These trios and quartets have a concordance rate greater than 80%. There are 33 genes involved in these trios and quartets. A factor analysis with varimax rotation identifies three gene groups based on their factor loadings. One group of 18 genes has a concordance rate of 56.7%, another group of 8 genes has a concordance rate of 60.8%, and a third group of 7 genes has a concordance rate of 69.6%. Each of these rates is highly significant, suggesting that there may be strong biological underpinnings for the mixture mechanisms of these genes. Bayesian factor screening confirms this hypothesis by identifying six single-nucleotide polymorphisms that are significantly associated with the expression phenotypes of the five most concordant genes in the first group.
引用
收藏
相关论文
共 50 条
  • [31] Cluster analysis using multivariate normal mixture models to detect differential gene expression with microarray data
    He, Yi
    Pan, Wei
    Lin, Jizhen
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (02) : 641 - 658
  • [32] Gene Screening and Clustering of Yeast Microarray Gene Expression Data
    Lee, Kyunga
    Kim, Taehoun
    Kim, Jaehee
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (06) : 1077 - 1094
  • [33] Factor Analysis with Mixture Modeling to Evaluate Coherent Patterns in Microarray Data
    Nunes Duarte, Joao Daniel
    Mayrink, Vinicius Diniz
    INTERDISCIPLINARY BAYESIAN STATISTICS, EBEB 2014, 2015, 118 : 185 - 195
  • [34] A mixture model-based approach to the clustering of microarray expression data
    McLachlan, GJ
    Bean, RW
    Peel, D
    BIOINFORMATICS, 2002, 18 (03) : 413 - 422
  • [35] GEPRO: Gene Expression Profiler for DNA microarray data
    Kim, Beob G.
    Lindemann, Merlin D.
    Bridges, Phillip J.
    Ko, CheMyong
    REVISTA COLOMBIANA DE CIENCIAS PECUARIAS, 2009, 22 (01) : 12 - 18
  • [36] Bayesian models for gene expression with DNA microarray data
    Ibrahim, JG
    Chen, MH
    Gray, RJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 88 - 99
  • [37] Statistical Quality Control of Microarray Gene Expression Data
    Lu, Shen
    Segall, Richard S.
    WMSCI 2011: 15TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, 2011, : 206 - 211
  • [38] MIDGET:Detecting differential gene expression on microarray data
    Angelescu, Radu
    Dobrescu, Radu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 211
  • [39] Quick hierarchical biclustering on microarray gene expression data
    Ji, Liping
    Mock, Kenneth Wei-Liang
    Tan, Kian-Lee
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 110 - +
  • [40] A genetic approach for gene selection on microarray expression data
    Kim, YH
    Lee, SY
    Moon, BR
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2004, PT 1, PROCEEDINGS, 2004, 3102 : 346 - 355