On constructing an optimal consensus clustering from multiple clusterings

被引:7
|
作者
Berman, Piotr [1 ]
DasGupta, Bhaskar
Kao, Ming-Yang
Wang, Jie
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[3] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[4] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
computational complexity; approximation algorithms; consensus clustering;
D O I
10.1016/j.ipl.2007.06.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computing a suitable measure of consensus among several clusterings on the same data is an important problem that arises in several areas such as computational biology and data mining. In this paper, we formalize a set-theoretic model for computing such a similarity measure. Roughly speaking, in this model we have k > 1 partitions (clusters) of the same data set each containing the same number of sets and the goal is to align the sets in each partition to minimize a similarity measure. For k = 2, a polynomial-time solution was proposed by Gusfield (Information Processing Letters 82 (2002) 159-164). In this paper, we show that the problem is MAX-SNP-hard for k = 3 even if each partition in each cluster contains no more than 2 elements and provide a 2-2/k-approximation algorithm for the problem for any k. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 145
页数:9
相关论文
共 50 条
  • [1] Using Soft Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    JURNAL TEKNOLOGI, 2013, 63 (01):
  • [2] Graph-Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    MOLECULAR INFORMATICS, 2013, 32 (02) : 165 - 178
  • [3] Voting-based consensus clustering for combining multiple clusterings of chemical structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    JOURNAL OF CHEMINFORMATICS, 2012, 4
  • [4] Voting-based consensus clustering for combining multiple clusterings of chemical structures
    Faisal Saeed
    Naomie Salim
    Ammar Abdo
    Journal of Cheminformatics, 4
  • [5] Information Theory and Voting Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    MOLECULAR INFORMATICS, 2013, 32 (07) : 591 - 598
  • [6] Unsupervised collaborative boosting of clustering: an unifying framework for multi-view clustering, multiple consensus clusterings and alternative clustering
    Sublemontier, Jacques-Henri
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [7] Consensus clusterings
    Nguyen, Nam
    Caruana, Rich
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 607 - 612
  • [8] Consensus Methods for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (05) : 1026 - 1034
  • [9] Clustering trees: a visualization for evaluating clusterings at multiple resolutions
    Zappia, Luke
    Oshlack, Alicia
    GIGASCIENCE, 2018, 7 (07):
  • [10] Implicit consensus clustering from multiple graphs
    Rafika Boutalbi
    Lazhar Labiod
    Mohamed Nadif
    Data Mining and Knowledge Discovery, 2021, 35 : 2313 - 2340