Clustering of gene expression data: Performance and similarity analysis

被引:0
|
作者
Yin, Longde [1 ]
Huang, Chun-Hsi [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
关键词
clustering algorithms; gene expression; microarray; cluster similarity analysis; performance study;
D O I
10.1109/IMSCCS.2006.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent advances of the DNA Microarray technology allow monitoring gene expression profiles of thousands of genes simultaneously. However, the analysis and handling of such fast growing data is becoming the major bottleneck in the utilization of the technology. Clustering analysis is one of the most effective methods for analyzing such gene expression data. In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering, Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA), using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. Then, we present a data mining tool, Cluster Diff, which allows the similarity analysis of clusters generated by different algorithms. A case study is conducted based on clusters generated by SOTA and SOM.
引用
收藏
页码:142 / +
页数:3
相关论文
共 50 条
  • [1] Clustering of gene expression data: performance and similarity analysis
    Longde Yin
    Chun-Hsi Huang
    Jun Ni
    BMC Bioinformatics, 7
  • [2] Clustering of gene expression data: performance and similarity analysis
    Yin, Longde
    Huang, Chun-Hsi
    Ni, Jun
    BMC BIOINFORMATICS, 2006, 7 (Suppl 4)
  • [3] A Similarity Measure for Clustering Gene Expression Data
    Baishya, Ram Charan
    Sarmah, Rosy
    Bhattacharyya, Dhruba Kumar
    Dutta, Malay Ananda
    APPLIED ALGORITHMS, 2014, 8321 : 245 - 256
  • [4] Clustering of Gene Expression Data Based on Shape Similarity
    Hestilow, Travis J.
    Huang, Yufei
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2009, (01)
  • [5] Application of a New Similarity Measure in Clustering Gene Expression Data
    Li, Gangguo
    Wang, Zhengzhi
    Ni, Qingshan
    Wang, Xiaomin
    Qiang, Bo
    Han Qing-juan
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 649 - +
  • [6] Gene expression data clustering based on local similarity combination
    Pan, D
    Wang, F
    PROCEEDINGS OF THE 4TH ASIA-PACIFIC BIOINFORMATICS CONFERENCE, 2006, 3 : 353 - 362
  • [7] Clustering analysis for gene expression data
    Chen, YD
    Ermolaeva, O
    Bittner, M
    Meltzer, P
    Trent, J
    Dougherty, ER
    Batman, S
    ADVANCES IN FLUORESCENCE SENSING TECHNOLOGY IV, PROCEEDINGS OF, 1999, 3602 : 422 - 428
  • [8] Analysis of gene expression data: clustering and beyond
    Zohar Yakhini
    Amir Ben-Dor
    Stuart Kim
    Ron Shamir
    Nature Genetics, 1999, 23 (Suppl 3) : 83 - 83
  • [9] Clustering gene expression data:: an experimental analysis
    Ortiz-Gama, S
    Sucar, LE
    Rodríguez, AF
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 168 - 175
  • [10] Ensemble clustering method based on the resampling similarity measure for gene expression data
    Kim, Seo Young
    Lee, Jae Won
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2007, 16 (06) : 539 - 564