An optimal hierarchical clustering algorithm for gene expression data

被引:12
|
作者
Seal, S [1 ]
Komarina, S [1 ]
Aluru, S [1 ]
机构
[1] Iowa State Univ Sci & Technol, Dept Elect & Comp Engn, Ames, IA 50011 USA
关键词
algorithms; computational geometry; gene expression; hierarchical clustering; microarrays;
D O I
10.1016/j.ipl.2004.11.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microarrays are used for measuring expression levels of thousands of genes simultaneously. Clustering algorithms are used on gene expression data to find co-regulated genes. An often used clustering strategy is the Pearson correlation coefficient based hierarchical clustering algorithm presented in [Proc. Nat. Acad. Sci. 95 (25) (1998) 14863-14868], which takes O(N-3) time. We note that this run time can be reduced to O(N-2) by applying known hierarchical clustering algorithms [Proc. 9th Annual ACM-SIAM Symposium on Discrete Algorithms, 1998, pp. 619-628] to this problem. In this paper. we present an algorithm which runs in O(N log N) time using a geometrical reduction and show that it is optimal. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:143 / 147
页数:5
相关论文
共 50 条
  • [1] Hierarchical clustering of gene expression data
    Luo, F
    Tang, K
    Khan, L
    [J]. THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 328 - 335
  • [2] DENCH: A density-based hierarchical clustering algorithm for gene expression data
    Sun Liang
    Zhao Fang
    Wang Yongji
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2007, 16 (01) : 24 - 29
  • [3] An efficient optimal leaf ordering for hierarchical clustering in microarray gene expression data analysis
    Zhang, JT
    Gruenwald, L
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 396 - 400
  • [4] Hierarchical Clustering of Gene Expression Data with Divergence Measure
    Liu, Weixiang
    Wang, Tianfu
    Chen, Siping
    Tang, Aifa
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 305 - +
  • [5] A repulsive clustering algorithm for gene expression data
    Cheng, CS
    Wang, SS
    [J]. THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 407 - 412
  • [6] The Clustering Algorithm Study of Gene Expression Data
    He Rui
    Lin Chunmei
    [J]. ENVIRONMENTAL BIOTECHNOLOGY AND MATERIALS ENGINEERING, PTS 1-3, 2011, 183-185 : 93 - +
  • [7] An improved algorithm for clustering gene expression data
    Bandyopadhyay, Sanghamitra
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    [J]. BIOINFORMATICS, 2007, 23 (21) : 2859 - 2865
  • [8] Clustering Gene Expression Data Through Modified Agglomerative M-CURE Hierarchical Algorithm
    Kavitha, E.
    Tamilarasan, R.
    Poonguzhali, N.
    Kannan, M. K. Jayanthi
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (03): : 1027 - 1041
  • [9] Gene expression data clustering and visualization based on a binary hierarchical clustering framework
    Szeto, LK
    Liew, AWC
    Yan, H
    Tang, SS
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2003, 14 (04): : 341 - 362
  • [10] Biologically supervised hierarchical clustering algorithms for gene expression data
    Boratyn, Grzegorz M.
    Datta, Susmita
    Datta, Somnath
    [J]. 2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 5681 - +