An algorithm to assess the reliability of hierarchical clusters in gene expression data

被引:0
|
作者
Avogadri, Roberto [1 ]
Brioschi, Matteo [2 ]
Ruffino, Francesca [1 ]
Ferrazzi, Fulvia [3 ]
Beghini, Alessandro [2 ]
Valentini, Giorgio [1 ]
机构
[1] Univ Milan, DSI, I-20122 Milan, Italy
[2] Univ Milan, DBioGen Dip Biol & Genet Sci Med, I-20122 Milan, Italy
[3] Univ Pavia, Dipartimento Informat & Sistemist, Pavia, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The validation of clusters discovered in bio-molecular data is a central issue in bioinformatics. Recently, stability-based methods have been successfully applied to the analysis of the reliability of clusterings characterized by a relatively low number of examples and clusters. Nevertheless, several problems in functional genomics are characterized by a very large number of examples and clusters. We present a stability-based algorithm to discover significant clusters in hierarchical clusterings with a large number of examples and clusters. Preliminary results on gene expression data of patients affected by Human Myeloid Leukemia, show how to apply the proposed method when thousands of gene clusters are involved.
引用
收藏
页码:764 / +
页数:3
相关论文
共 50 条
  • [41] A sequential clustering algorithm with applications to gene expression data
    Song, Jongwoo
    Nicolae, Dan L.
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2009, 38 (02) : 175 - 184
  • [42] A sequential clustering algorithm with applications to gene expression data
    Jongwoo Song
    Dan L. Nicolae
    [J]. Journal of the Korean Statistical Society, 2009, 38 : 175 - 184
  • [43] Biclustering of gene expression data using genetic algorithm
    Chakraborty, A
    Maka, H
    [J]. PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 17 - 24
  • [44] Biclustering gene expression data by an improved optimal algorithm
    Wang, MingQian
    Tian, Wei
    Kang, Hao
    Gao, WenJu
    [J]. MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2223 - 2226
  • [45] GenClust: A genetic algorithm for clustering gene expression data
    Vito Di Gesú
    Raffaele Giancarlo
    Giosué Lo Bosco
    Alessandra Raimondi
    Davide Scaturro
    [J]. BMC Bioinformatics, 6
  • [46] Analysis on time-lagged gene clusters in time series gene expression data
    Zeng, Tao
    Liu, Juan
    [J]. CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 181 - +
  • [47] Mining gene expression data for positive and negative co-regulated gene clusters
    Ji, LP
    Tan, KL
    [J]. BIOINFORMATICS, 2004, 20 (16) : 2711 - 2718
  • [48] Robust complementary hierarchical clustering for gene expression data analysis by β-divergence
    Badsha, Md. Bahadur
    Mollah, Md. Nurul Hague
    Jahan, Nusrat
    Kurata, Hiroyuki
    [J]. JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2013, 116 (03) : 397 - 407
  • [49] A Hierarchical Graph Convolution Network for Representation Learning of Gene Expression Data
    Tan, Kaiwen
    Huang, Weixian
    Liu, Xiaofeng
    Hu, Jinlong
    Dong, Shoubin
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (08) : 3219 - 3229
  • [50] Comments on 'Bayesian hierarchical error model for analysis of gene expression data'
    Wu, Xiao-Lin
    Forney, Larry J.
    Joyce, Paul
    [J]. BIOINFORMATICS, 2006, 22 (19) : 2446 - 2451