Kernel hierarchical gene clustering from microarray expression data

被引:40
|
作者
Qin, J
Lewis, DP
Noble, WS
机构
[1] Columbia Univ, Columbia Genome Ctr, New York, NY 10032 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
[3] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
D O I
10.1093/bioinformatics/btg288
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Unsupervised analysis of microarray gene expression data attempts to find biologically significant patterns within a given collection of expression measurements. For example, hierarchical clustering can be applied to expression profiles of genes across multiple experiments, identifying groups of genes that share similiar expression profiles. Previous work using the support vector machine supervised learning algorithm with microarray data suggests that higher-order features, such as pairwise and tertiary correlations across multiple experiments, may provide significant benefit in learning to recognize classes of co-expressed genes. Results: We describe a generalization of the hierarchical clustering algorithm that efficiently incorporates these higher-order features by using a kernel function to map the data into a high-dimensional feature space. We then evaluate the utility of the kernel hierarchical clustering algorithm using both internal and external validation. The experiments demonstrate that the kernel representation itself is insufficient to provide improved clustering performance. We conclude that mapping gene expression data into a high-dimensional feature space is only a good idea when combined with a learning algorithm, such as the support vector machine that does not suffer from the curse of dimensionality.
引用
收藏
页码:2097 / 2104
页数:8
相关论文
共 50 条
  • [31] A Hierarchical Approach for Clustering and Pattern Matching of Gene Expression Data
    Hoque, Soriful
    Istyaq, Salim
    Riaz, Md Mushir
    [J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 413 - 416
  • [32] Comparisons and validation of statistical clustering techniques for microarray gene expression data
    Datta, S
    Datta, S
    [J]. BIOINFORMATICS, 2003, 19 (04) : 459 - 466
  • [33] Clustering microarray gene expression data using enhanced harmony search
    Pandi, M.
    Premalatha, K.
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2015, 7 (05) : 296 - 306
  • [34] Beyond Gene Clustering of Microarray Data
    Zheng, W. Jim
    Mao, Linyong
    Moussa, Omar
    Yordy, John S.
    Wang, Suiquan
    Kraft, Andrew S.
    Watson, Dennis K.
    [J]. IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2009, 45 : S3 - S3
  • [35] Kernel based nonlinear dimensionality reduction for microarray gene expression data analysis
    Li, Xuehua
    Shu, Lan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7644 - 7650
  • [36] An effective fuzzy kernel clustering analysis approach for gene expression data
    Sun, Lin
    Xu, Jiucheng
    Yin, Jiaojiao
    [J]. BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1863 - S1869
  • [37] Performance assessment of kernel density clustering for gene expression profile data
    Shu, GP
    Zeng, BY
    Chen, YPP
    Smith, OH
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (03): : 287 - 299
  • [38] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    [J]. AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [39] Evolutionary Techniques for Hierarchical Clustering Applied to Microarray Data
    Castellanos-Garzon, Jos A.
    Miguel-Quintales, Luis A.
    [J]. 2ND INTERNATIONAL WORKSHOP ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (IWPACBB 2008), 2009, 49 : 118 - 127
  • [40] Hierarchical Signature Clustering for Time Series Microarray Data
    Koenig, Lars
    Youn, Eunseog
    [J]. SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 57 - 65