Including probe-level uncertainty in model-based gene expression clustering

被引：16

作者：

Liu, Xuejun

Lin, Kevin K.

Andersen, Bogi

Rattray, Magnus

机构：

[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England

[2] Nanjing Univ Aeronaut & Astronaut, Coll Informat Sci & Technol, Nanjing 210016, Peoples R China

[3] Univ Calif Irvine, Inst Genom & Bioinformat, Dept Biol Chem, Irvine, CA 92697 USA

来源：

BMC BIOINFORMATICS | 2007年 / 8卷 / 1期

基金：

英国生物技术与生命科学研究理事会;

关键词：

D O I：

10.1186/1471-2105-8-98

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Clustering is an important analysis performed on microarray gene expression data since it groups genes which have similar expression patterns and enables the exploration of unknown gene functions. Microarray experiments are associated with many sources of experimental and biological variation and the resulting gene expression data are therefore very noisy. Many heuristic and model-based clustering approaches have been developed to cluster this noisy data. However, few of them include consideration of probe- level measurement error which provides rich information about technical variability. Results: We augment a standard model-based clustering method to incorporate probe- level measurement error. Using probe-level measurements from a recently developed Affymetrix probe- level model, multi-mgMOS, we include the probe- level measurement error directly into the standard Gaussian mixture model. Our augmented model is shown to provide improved clustering performance on simulated datasets and a real mouse time-course dataset. Conclusion: The performance of model-based clustering of gene expression data is improved by including probe- level measurement error and more biologically meaningful clustering results are obtained.

引用

页数：19

共 50 条

[41] Parametric model-based clustering
Nikulin, V
Smola, AJ
DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2005, 2005, 5812 : 190 - 201
[42] Niche gene expression programming based on clustering model
Lin, Yishen
Peng, Hong
IITA 2007: WORKSHOP ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, PROCEEDINGS, 2007, : 10 - 13
[43] Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays
Leah Barrera
Chris Benner
Yong-Chuan Tao
Elizabeth Winzeler
Yingyao Zhou
BMC Bioinformatics, 5
[44] Probability of misclassification in model-based clustering
Xuwen Zhu
Computational Statistics, 2019, 34 : 1427 - 1442
[45] Model-based clustering for random hypergraphs
Tin Lok James Ng
Thomas Brendan Murphy
Advances in Data Analysis and Classification, 2022, 16 : 691 - 723
[46] Model-based clustering for populations of networks
Signorelli, Mirko
Wit, Ernst C.
STATISTICAL MODELLING, 2020, 20 (01) : 9 - 29
[47] Model-based clustering of longitudinal data
McNicholas, Paul D.
Murphy, T. Brendan
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
[48] Probe-Level Analysis of Expression Microarrays Characterizes Isoform-Specific Degradation during Mouse Oocyte Maturation
Salisbury, Jesse
Hutchison, Keith W.
Wigglesworth, Karen
Eppig, John J.
Graber, Joel H.
PLOS ONE, 2009, 4 (10):
[49] Boosting for model-based data clustering
Saffari, Amir
Bischof, Horst
PATTERN RECOGNITION, 2008, 5096 : 51 - 60
[50] Dimension reduction for model-based clustering
Luca Scrucca
Statistics and Computing, 2010, 20 : 471 - 484

← 1 2 3 4 5 →