Including probe-level uncertainty in model-based gene expression clustering

被引:16
|
作者
Liu, Xuejun
Lin, Kevin K.
Andersen, Bogi
Rattray, Magnus
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[2] Nanjing Univ Aeronaut & Astronaut, Coll Informat Sci & Technol, Nanjing 210016, Peoples R China
[3] Univ Calif Irvine, Inst Genom & Bioinformat, Dept Biol Chem, Irvine, CA 92697 USA
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1186/1471-2105-8-98
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Clustering is an important analysis performed on microarray gene expression data since it groups genes which have similar expression patterns and enables the exploration of unknown gene functions. Microarray experiments are associated with many sources of experimental and biological variation and the resulting gene expression data are therefore very noisy. Many heuristic and model-based clustering approaches have been developed to cluster this noisy data. However, few of them include consideration of probe- level measurement error which provides rich information about technical variability. Results: We augment a standard model-based clustering method to incorporate probe- level measurement error. Using probe-level measurements from a recently developed Affymetrix probe- level model, multi-mgMOS, we include the probe- level measurement error directly into the standard Gaussian mixture model. Our augmented model is shown to provide improved clustering performance on simulated datasets and a real mouse time-course dataset. Conclusion: The performance of model-based clustering of gene expression data is improved by including probe- level measurement error and more biologically meaningful clustering results are obtained.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Model-based clustering with genes expression dynamics for time-course gene expression data
    Wu, FX
    Zhang, WJ
    Kusalik, AJ
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 267 - 274
  • [22] Model-based clustering of Baltic sea-level
    Scotto, M. G.
    Barbosa, Susana M.
    Alonso, Andres M.
    APPLIED OCEAN RESEARCH, 2009, 31 (01) : 4 - 11
  • [23] Model-Based Clustering
    Paul D. McNicholas
    Journal of Classification, 2016, 33 : 331 - 373
  • [24] Model-Based Clustering
    Gormley, Isobel Claire
    Murphy, Thomas Brendan
    Raftery, Adrian E.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 573 - 595
  • [25] Model-Based Clustering
    McNicholas, Paul D.
    JOURNAL OF CLASSIFICATION, 2016, 33 (03) : 331 - 373
  • [26] A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips
    Liu, XJ
    Milo, M
    Lawrence, ND
    Rattray, M
    BIOINFORMATICS, 2005, 21 (18) : 3637 - 3644
  • [27] Model-Based Clustering of Longitudinal Data: Application to Modeling Disease Course and Gene Expression Trajectories
    Ciampi, A.
    Campbell, H.
    Dyachenko, A.
    Rich, B.
    McCusker, J.
    Cole, M. G.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (07) : 992 - 1005
  • [28] Bayesian model-based clustering of temporal gene expression using autoregressive panel data approach
    Nascimento, Moyses
    Safadi, Thelma
    Fonseca e Silva, Fabyano
    Nascimento, Ana Carolina C.
    BIOINFORMATICS, 2012, 28 (15) : 2004 - 2007
  • [29] INTEGRATIVE MODEL-BASED CLUSTERING OF MICROARRAY METHYLATION AND EXPRESSION DATA
    Kormaksson, Matthias
    Booth, James G.
    Figueroa, Maria E.
    Melnick, Ari
    ANNALS OF APPLIED STATISTICS, 2012, 6 (03): : 1327 - 1347
  • [30] A mixture model-based approach to the clustering of microarray expression data
    McLachlan, GJ
    Bean, RW
    Peel, D
    BIOINFORMATICS, 2002, 18 (03) : 413 - 422