Kernel hierarchical gene clustering from microarray expression data

被引:40
|
作者
Qin, J
Lewis, DP
Noble, WS
机构
[1] Columbia Univ, Columbia Genome Ctr, New York, NY 10032 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
[3] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/btg288
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Unsupervised analysis of microarray gene expression data attempts to find biologically significant patterns within a given collection of expression measurements. For example, hierarchical clustering can be applied to expression profiles of genes across multiple experiments, identifying groups of genes that share similiar expression profiles. Previous work using the support vector machine supervised learning algorithm with microarray data suggests that higher-order features, such as pairwise and tertiary correlations across multiple experiments, may provide significant benefit in learning to recognize classes of co-expressed genes. Results: We describe a generalization of the hierarchical clustering algorithm that efficiently incorporates these higher-order features by using a kernel function to map the data into a high-dimensional feature space. We then evaluate the utility of the kernel hierarchical clustering algorithm using both internal and external validation. The experiments demonstrate that the kernel representation itself is insufficient to provide improved clustering performance. We conclude that mapping gene expression data into a high-dimensional feature space is only a good idea when combined with a learning algorithm, such as the support vector machine that does not suffer from the curse of dimensionality.
引用
收藏
页码:2097 / 2104
页数:8
相关论文
共 50 条
  • [1] Clustering gene expression signals from retinal microarray data
    Fleury, G
    Hero, A
    Yoshida, S
    Carter, T
    Barlow, C
    Swaroop, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4024 - 4027
  • [2] Hierarchical clustering of gene expression data
    Luo, F
    Tang, K
    Khan, L
    [J]. THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, : 328 - 335
  • [3] A High-Speed Two Dimensional Hierarchical Clustering of Microarray Gene Expression Data
    Priscilla, R.
    Swamynathan, S.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 539 - +
  • [4] An efficient optimal leaf ordering for hierarchical clustering in microarray gene expression data analysis
    Zhang, JT
    Gruenwald, L
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 396 - 400
  • [5] Clustering methods for microarray gene expression data
    Belacel, Nabil
    Wang, Qian
    Cuperlovic-Culf, Miroslava
    [J]. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (04) : 507 - 531
  • [6] Gene Screening and Clustering of Yeast Microarray Gene Expression Data
    Lee, Kyunga
    Kim, Taehoun
    Kim, Jaehee
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (06) : 1077 - 1094
  • [7] Quick hierarchical biclustering on microarray gene expression data
    Ji, Liping
    Mock, Kenneth Wei-Liang
    Tan, Kian-Lee
    [J]. BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 110 - +
  • [8] Clustering of Association Rules on Microarray Gene Expression Data
    Alagukumar, S.
    Vanitha, C. Devi Arockia
    Lawrance, R.
    [J]. ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 85 - 97
  • [9] Clustering approaches to identifying gene expression patterns from DNA microarray data
    Do, Jin Hwan
    Choi, Dong-Kug
    [J]. MOLECULES AND CELLS, 2008, 25 (02) : 279 - 288
  • [10] A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data
    Priscilla, R.
    Swamynathan, S.
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (02) : 204 - 213