Problems in gene clustering based on gene expression data

被引:36
|
作者
Bryan, J
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6M 1L2, Canada
[2] Univ British Columbia, Biotechnol Lab, Vancouver, BC V6M 1L2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
cluster analysis; microarrays; confidence; bootstrap;
D O I
10.1016/j.jmva.2004.02.011
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this work, we assess the suitability of cluster analysis for the gene grouping problem confronted with microarray data. Gene clustering is the exercise of grouping genes based on attributes, which are generally the expression levels over a number of conditions or subpopulations. The hope is that similarity with respect to expression is often indicative of similarity with respect to much more fundamental and elusive qualities, such as function. By formally defining the true gene-specific attributes as parameters, such as expected expression across the conditions, we obtain a well-defined gene clustering parameter of interest, which greatly facilitates the statistical treatment of gene clustering. We point out that genome-wide collections of expression trajectories often lack natural clustering structure, prior to ad hoc gene filtering. The gene filters in common use induce a certain circularity to most gene cluster analyses: genes are points in the attribute space, a filter is applied to depopulate certain areas of the space, and then clusters are sought (and often found!) in the "cleaned" attribute space. As a result, statistical investigations of cluster number and clustering strength are just as much a study of the stringency and nature of the filter as they are of any biological gene clusters. In the absence of natural clusters, gene clustering may still be a worthwhile exercise in data segmentation. In this context, partitions can be fruitfully encoded in adjacency matrices and the sampling distribution of such matrices can be studied with a variety of bootstrapping techniques. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:44 / 66
页数:23
相关论文
共 50 条
  • [1] Gene-Ontology-based clustering of gene expression data
    Adryan, B
    Schuh, R
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2851 - 2852
  • [2] Projection Based Clustering of Gene Expression Data
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    Tasoulis, Dimitris K.
    [J]. COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2010, 6160 : 228 - +
  • [3] Fuzzy Rule Based Clustering for Gene Expression Data
    Sinaee, Mehrnoosh
    Mansoori, Eghbal G.
    [J]. FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION (ISMS 2013), 2013, : 7 - 11
  • [4] Clustering of Gene Expression Data Based on Shape Similarity
    Hestilow, Travis J.
    Huang, Yufei
    [J]. EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [5] DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA
    Bocicor, Maria-Iuliana
    Sirbu, Adela
    Czibula, Gabriela
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1051 - 1069
  • [6] Dynamic core based clustering of gene expression data
    [J]. 1600, ICIC International (10):
  • [7] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [8] Incorporating gene ontology in clustering gene expression data
    Kustra, Rafal
    Zagdanski, Adam
    [J]. 19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2006, : 555 - +
  • [9] Model-based clustering and data transformations for gene expression data
    Yeung, KY
    Fraley, C
    Murua, A
    Raftery, AE
    Ruzzo, WL
    [J]. BIOINFORMATICS, 2001, 17 (10) : 977 - 987
  • [10] Clustering analysis for gene expression data
    Chen, YD
    Ermolaeva, O
    Bittner, M
    Meltzer, P
    Trent, J
    Dougherty, ER
    Batman, S
    [J]. ADVANCES IN FLUORESCENCE SENSING TECHNOLOGY IV, PROCEEDINGS OF, 1999, 3602 : 422 - 428