On constructing an optimal consensus clustering from multiple clusterings

被引:7
|
作者
Berman, Piotr [1 ]
DasGupta, Bhaskar
Kao, Ming-Yang
Wang, Jie
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
[2] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[3] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[4] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
computational complexity; approximation algorithms; consensus clustering;
D O I
10.1016/j.ipl.2007.06.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computing a suitable measure of consensus among several clusterings on the same data is an important problem that arises in several areas such as computational biology and data mining. In this paper, we formalize a set-theoretic model for computing such a similarity measure. Roughly speaking, in this model we have k > 1 partitions (clusters) of the same data set each containing the same number of sets and the goal is to align the sets in each partition to minimize a similarity measure. For k = 2, a polynomial-time solution was proposed by Gusfield (Information Processing Letters 82 (2002) 159-164). In this paper, we show that the problem is MAX-SNP-hard for k = 3 even if each partition in each cluster contains no more than 2 elements and provide a 2-2/k-approximation algorithm for the problem for any k. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 145
页数:9
相关论文
共 50 条
  • [21] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [22] Multiple Independent Subspace Clusterings
    Wang, Xing
    Wang, Jun
    Domeniconi, Carlotta
    Yu, Guoxian
    Xiao, Guoqiang
    Guo, Maozu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5353 - 5360
  • [23] Finding Multiple Stable Clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 171 - 180
  • [24] Finding multiple stable clusterings
    Juhua Hu
    Qi Qian
    Jian Pei
    Rong Jin
    Shenghuo Zhu
    Knowledge and Information Systems, 2017, 51 : 991 - 1021
  • [25] Consensus local graph for multiple kernel clustering
    Liu, Zheng
    Huang, Shiluo
    Jin, Wei
    Mu, Ying
    NEUROCOMPUTING, 2024, 602
  • [26] Double versus optimal grade clusterings
    Ciok, A
    DATA ANALYSIS, CLASSIFICATION, AND RELATED METHODS, 2000, : 41 - 46
  • [27] Multiple Co-Clusterings
    Wang, Xing
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Yu, Zhiwen
    Zhang, Zili
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1308 - 1313
  • [28] Combining multiple weak clusterings
    Topchy, A
    Jain, AK
    Punch, W
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 331 - 338
  • [29] Exploratory Consensus of Hierarchical Clusterings for Melanoma and Breast Cancer
    Mahata, Pritha
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (01) : 138 - 152
  • [30] CLICOM: Cliques for combining multiple clusterings
    Mimaroglu, Selim
    Yagci, Murat
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1889 - 1901