An algorithm to assess the reliability of hierarchical clusters in gene expression data

被引:0
|
作者
Avogadri, Roberto [1 ]
Brioschi, Matteo [2 ]
Ruffino, Francesca [1 ]
Ferrazzi, Fulvia [3 ]
Beghini, Alessandro [2 ]
Valentini, Giorgio [1 ]
机构
[1] Univ Milan, DSI, I-20122 Milan, Italy
[2] Univ Milan, DBioGen Dip Biol & Genet Sci Med, I-20122 Milan, Italy
[3] Univ Pavia, Dipartimento Informat & Sistemist, Pavia, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The validation of clusters discovered in bio-molecular data is a central issue in bioinformatics. Recently, stability-based methods have been successfully applied to the analysis of the reliability of clusterings characterized by a relatively low number of examples and clusters. Nevertheless, several problems in functional genomics are characterized by a very large number of examples and clusters. We present a stability-based algorithm to discover significant clusters in hierarchical clusterings with a large number of examples and clusters. Preliminary results on gene expression data of patients affected by Human Myeloid Leukemia, show how to apply the proposed method when thousands of gene clusters are involved.
引用
收藏
页码:764 / +
页数:3
相关论文
共 50 条
  • [31] A Novel Approach for Discovering Overlapping Clusters in Gene Expression Data
    Ma, Patrick C. H.
    Chan, Keith C. C.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (07) : 1803 - 1809
  • [32] Ontologizing gene-expression microarray data:: characterizing clusters with Gene Ontology
    Robinson, PN
    Wollstein, A
    Böhme, U
    Beattie, B
    [J]. BIOINFORMATICS, 2004, 20 (06) : 979 - 981
  • [33] Identifying time-lagged gene clusters using gene expression data
    Ji, LP
    Tan, KL
    [J]. BIOINFORMATICS, 2005, 21 (04) : 509 - 516
  • [34] Using degradation data to assess reliability
    Hamada, Michael
    [J]. Quality Engineering, 2005, 17 (04) : 615 - 620
  • [35] Probabilistic estimation of microarray data reliability and underlying gene expression
    Sven Bilke
    Thomas Breslin
    Mikael Sigvardsson
    [J]. BMC Bioinformatics, 4
  • [36] Probabilistic estimation of microarray data reliability and underlying gene expression
    Bilke, S
    Breslin, T
    Sigvardsson, M
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [37] A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
    Yan Zhang
    Weiyu Shi
    Yeqing Sun
    [J]. BMC Genomics, 24
  • [38] A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
    Zhang, Yan
    Shi, Weiyu
    Sun, Yeqing
    [J]. BMC GENOMICS, 2023, 24 (01)
  • [39] An Efficient Weighted Biclustering Algorithm for Gene Expression Data
    Jia, Yankun
    Li, Yidong
    Liu, Wenhua
    Dong, Hairong
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 336 - 341
  • [40] CIS: A nonparametric clustering algorithm for gene expression data
    Zhao, YH
    Yin, Y
    Wang, GR
    Mao, KM
    [J]. PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 651 - 656