Detecting clusters of different geometrical shapes in microarray gene expression data

被引:32
|
作者
Kim, DW
Lee, KH
Lee, D
机构
[1] Korea Adv Inst Sci & Technol, Dept BioSyst, Taejon 305701, South Korea
[2] Korea Adv Inst Sci & Technol, Adv Informat Technol Res Ctr, Taejon 305701, South Korea
[3] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Taejon 305701, South Korea
关键词
D O I
10.1093/bioinformatics/bti251
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Clustering has been used as a popular technique for finding groups of genes that show similar expression patterns under multiple experimental conditions. Many clustering methods have been proposed for clustering gene-expression data, including the hierarchical clustering, k-means clustering and self-organizing map (SOM). However, the conventional methods are limited to identify different shapes of clusters because they use a fixed distance norm when calculating the distance between genes. The fixed distance norm imposes a fixed geometrical shape on the clusters regardless of the actual data distribution. Thus, different distance norms are required for handling the different shapes of clusters. Results: We present the Gustafson-Kessel (GK) clustering method for microarray gene-expression data. To detect clusters of different shapes in a dataset, we use an adaptive distance norm that is calculated by a fuzzy covariance matrix (F) of each cluster in which the eigenstructure of F is used as an indicator of the shape of the cluster. Moreover, the GK method is less prone to falling into local minima than the k-means and SOM because it makes decisions through the use of membership degrees of a gene to clusters. The algorithmic procedure is accomplished by the alternating optimization technique, which iteratively improves a sequence of sets of clusters until no further improvement is possible. To test the performance of the GK method, we applied the GK method and well-known conventional methods to three recently published yeast datasets, and compared the performance of each method using the Saccharomyces Genome Database annotations. The clustering results of the GK method are more significantly relevant to the biological annotations than those of the other methods, demonstrating its effectiveness and potential for clustering gene-expression data.
引用
收藏
页码:1927 / 1934
页数:8
相关论文
共 50 条
  • [41] Statistical Quality Control of Microarray Gene Expression Data
    Lu, Shen
    Segall, Richard S.
    [J]. WMSCI 2011: 15TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, 2011, : 206 - 211
  • [42] Bayesian models for gene expression with DNA microarray data
    Ibrahim, JG
    Chen, MH
    Gray, RJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 88 - 99
  • [43] Covariance structure models for gene expression microarray data
    Xie, J
    Bentler, PM
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2003, 10 (04) : 566 - 582
  • [44] AVA: visual analysis of gene expression microarray data
    Zhou, YH
    Liu, JD
    [J]. BIOINFORMATICS, 2003, 19 (02) : 293 - 294
  • [45] An efficient approach for classification of gene expression microarray data
    Sreepada, Rama Syamala
    Vipsita, Swati
    Mohapatra, Puspanjali
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 344 - 348
  • [46] Differential analysis of DNA microarray gene expression data
    Hatfield, GW
    Hung, SP
    Baldi, P
    [J]. MOLECULAR MICROBIOLOGY, 2003, 47 (04) : 871 - 877
  • [47] The Impact of Gene Selection on Imbalanced Microarray Expression Data
    Kamal, Abu H. M.
    Zhu, Xingquan
    Pandya, Abhijit S.
    Hsu, Sam
    Shoaib, Muhammad
    [J]. BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 259 - 269
  • [48] Gene expression (microarray) data analysis by chemometric methods
    Zhu, David X.
    Goeke, Richard J.
    Baker, David L.
    Hamburg, James H.
    Booth, David E.
    Booth, Stephane E.
    [J]. CURRENT ANALYTICAL CHEMISTRY, 2007, 3 (03) : 233 - 237
  • [49] Assessing the Evolution of Gene Expression Using Microarray Data
    Woody, Owen Z.
    Doxey, Andrew C.
    McConkey, Brendan J.
    [J]. EVOLUTIONARY BIOINFORMATICS, 2008, 4 : 139 - 152
  • [50] Assessing sources of variability in microarray gene expression data
    Spruill, SE
    Lu, J
    Hardy, S
    Weir, B
    [J]. BIOTECHNIQUES, 2002, 33 (04) : 916 - +