Including probe-level uncertainty in model-based gene expression clustering

被引:16
|
作者
Liu, Xuejun
Lin, Kevin K.
Andersen, Bogi
Rattray, Magnus
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[2] Nanjing Univ Aeronaut & Astronaut, Coll Informat Sci & Technol, Nanjing 210016, Peoples R China
[3] Univ Calif Irvine, Inst Genom & Bioinformat, Dept Biol Chem, Irvine, CA 92697 USA
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1186/1471-2105-8-98
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Clustering is an important analysis performed on microarray gene expression data since it groups genes which have similar expression patterns and enables the exploration of unknown gene functions. Microarray experiments are associated with many sources of experimental and biological variation and the resulting gene expression data are therefore very noisy. Many heuristic and model-based clustering approaches have been developed to cluster this noisy data. However, few of them include consideration of probe- level measurement error which provides rich information about technical variability. Results: We augment a standard model-based clustering method to incorporate probe- level measurement error. Using probe-level measurements from a recently developed Affymetrix probe- level model, multi-mgMOS, we include the probe- level measurement error directly into the standard Gaussian mixture model. Our augmented model is shown to provide improved clustering performance on simulated datasets and a real mouse time-course dataset. Conclusion: The performance of model-based clustering of gene expression data is improved by including probe- level measurement error and more biologically meaningful clustering results are obtained.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Parametric model-based clustering
    Nikulin, V
    Smola, AJ
    DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2005, 2005, 5812 : 190 - 201
  • [42] Niche gene expression programming based on clustering model
    Lin, Yishen
    Peng, Hong
    IITA 2007: WORKSHOP ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, PROCEEDINGS, 2007, : 10 - 13
  • [43] Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays
    Leah Barrera
    Chris Benner
    Yong-Chuan Tao
    Elizabeth Winzeler
    Yingyao Zhou
    BMC Bioinformatics, 5
  • [44] Probability of misclassification in model-based clustering
    Xuwen Zhu
    Computational Statistics, 2019, 34 : 1427 - 1442
  • [45] Model-based clustering for random hypergraphs
    Tin Lok James Ng
    Thomas Brendan Murphy
    Advances in Data Analysis and Classification, 2022, 16 : 691 - 723
  • [46] Model-based clustering for populations of networks
    Signorelli, Mirko
    Wit, Ernst C.
    STATISTICAL MODELLING, 2020, 20 (01) : 9 - 29
  • [47] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [48] Probe-Level Analysis of Expression Microarrays Characterizes Isoform-Specific Degradation during Mouse Oocyte Maturation
    Salisbury, Jesse
    Hutchison, Keith W.
    Wigglesworth, Karen
    Eppig, John J.
    Graber, Joel H.
    PLOS ONE, 2009, 4 (10):
  • [49] Boosting for model-based data clustering
    Saffari, Amir
    Bischof, Horst
    PATTERN RECOGNITION, 2008, 5096 : 51 - 60
  • [50] Dimension reduction for model-based clustering
    Luca Scrucca
    Statistics and Computing, 2010, 20 : 471 - 484