Comparing the performance of biomedical clustering methods

被引:0
|
作者
Wiwie, Christian [1 ]
Baumbach, Jan [1 ,2 ,3 ]
Rottger, Richard [1 ]
机构
[1] Univ Southern Denmark, Dept Math & Comp Sci, Odense, Denmark
[2] Max Planck Inst Informat, Computat Syst Biol, D-66123 Saarbrucken, Germany
[3] Univ Saarland, Cluster Excellence Multimodal Comp & Interact, D-66123 Saarbrucken, Germany
关键词
PROTEIN-INTERACTION NETWORKS; GENE-EXPRESSION DATA; MICROARRAY DATA; AUTOMATED-METHOD; ALGORITHMS; COMPLEXES; DISCOVERY; DATABASE; MODEL;
D O I
10.1038/NMETH.3583
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying groups of similar objects is a popular first step in biomedical data analysis, but it is error-prone and impossible to perform manually. Many computational methods have been developed to tackle this problem. Here we assessed 13 well-known methods using 24 data sets ranging from gene expression to protein domains. Performance was judged on the basis of 13 common cluster validity indices. We developed a clustering analysis platform, ClustEval (http://clusteval.mpi-inf.mpg.de), to promote streamlined evaluation, comparison and reproducibility of clustering results in the future. This allowed us to objectively evaluate the performance of all tools on all data sets with up to 1,000 different parameter sets each, resulting in a total of more than 4 million calculated cluster validity indices. We observed that there was no universal best performer, but on the basis of this wide-ranging comparison we were able to develop a short guideline for biomedical clustering tasks. ClustEval allows biomedical researchers to pick the appropriate tool for their data type and allows method developers to compare their tool to the state of the art.
引用
收藏
页码:1033 / 1038
页数:6
相关论文
共 50 条
  • [31] 'Combining Hypoxic Methods for Peak Performance': a Biomedical Engineering Perspective
    Bassovitch, Oleg
    SPORTS MEDICINE, 2010, 40 (06) : 519 - 521
  • [32] ‘Combining Hypoxic Methods for Peak Performance’: a Biomedical Engineering Perspective
    Oleg Bassovitch
    Sports Medicine, 2010, 40 : 519 - 521
  • [33] Guiding biomedical clustering with ClustEval
    Christian Wiwie
    Jan Baumbach
    Richard Röttger
    Nature Protocols, 2018, 13 : 1429 - 1444
  • [34] Guiding biomedical clustering with ClustEval
    Wiwie, Christian
    Baumbach, Jan
    Rottger, Richard
    NATURE PROTOCOLS, 2018, 13 (06) : 1429 - 1444
  • [35] Clustering of Biomedical Scientific Papers
    Bravo-Alcobendas, D.
    Sorzano, C. O. S.
    WISP 2009: 6TH IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, PROCEEDINGS, 2009, : 205 - 209
  • [36] The Performance of Different Clustering Methods in the Objective Assessment of Fabric Pilling
    Saharkhiz, Siamak
    Abdorazaghi, Mohammad
    JOURNAL OF ENGINEERED FIBERS AND FABRICS, 2012, 7 (04): : 35 - 41
  • [37] PERFORMANCE ANALYSIS OF ENTROPY METHODS ON K MEANS IN CLUSTERING PROCESS
    Lubis, Mhd Dicky Syahputra
    Mawengkang, Herman
    Suwilo, Saib
    INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICONICT), 2017, 930
  • [38] Comparing Teaching Methods to Optimize Student Performance in the Horticulture Classroom
    Maynard, Rebekah C. I.
    Ruter, John
    Campbell, Julie
    HORTSCIENCE, 2021, 56 (09) : S235 - S235
  • [39] Comparing performance of modern genotype imputation methods in different ethnicities
    Nab Raj Roshyara
    Katrin Horn
    Holger Kirsten
    Peter Ahnert
    Markus Scholz
    Scientific Reports, 6
  • [40] Performance Analysis of Unsupervised Clustering Methods for Brain Tumor Segmentation
    Jaware, Tushar H.
    Khanchandani, K. B.
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2013, 4 (1-4): : 55 - 59