Semi-Supervised Consensus Clustering: Reducing Human Effort

被引:1
|
作者
Vogel, Tobias [1 ]
Naumann, Felix [1 ]
机构
[1] Hasso Plattner Inst, Potsdam, Germany
关键词
D O I
10.1109/ICDMW.2014.97
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine-based clustering yields fuzzy results. For example, when detecting duplicates in a dataset, different tools might end up with different clusterings. Eventually, a decision needs to be made, defining which records are in the same cluster, i.e., are duplicates. Such a definitive result is called a Consensus Clustering and can be created by evaluating the clustering attempts against each other and only resolving the disagreements by human experts. Yet, there can be different consensus clusterings, depending on the choice of disagreements presented to the human expert. In particular, they may require a different number of manual inspections. We present a set of strategies to select the smallest set of manual inspections to arrive at a consensus clustering and evaluate their efficiency on a set of real-world and synthetic datasets.
引用
下载
收藏
页码:1095 / 1104
页数:10
相关论文
共 50 条
  • [21] Fast semi-supervised evidential clustering
    Antoine, Violaine
    Guerrero, Jose A.
    Xie, Jiarui
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 133 (133) : 116 - 132
  • [22] Semi-supervised Power Iteration Clustering
    Yang, Yuqi
    Bie, Rongfang
    Wu, Hao
    Xu, Shuaijing
    Li, Liangchi
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 588 - 595
  • [23] Semi-Supervised Clustering with Neural Networks
    Shukla, Ankita
    Cheema, Gullal S.
    Anand, Saket
    2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2020), 2020, : 152 - 161
  • [24] Evolutionary semi-supervised fuzzy clustering
    Liu, H
    Huang, ST
    PATTERN RECOGNITION LETTERS, 2003, 24 (16) : 3105 - 3113
  • [25] A Semi-supervised Clustering for Incomplete Data
    Goel, Sonia
    Tushir, Meena
    APPLICATIONS OF ARTIFICIAL INTELLIGENCE TECHNIQUES IN ENGINEERING, SIGMA 2018, VOL 1, 2019, 698 : 323 - 331
  • [26] Active semi-supervised fuzzy clustering
    Grira, Nizar
    Crucianu, Michel
    Boujemaa, Nozha
    PATTERN RECOGNITION, 2008, 41 (05) : 1834 - 1844
  • [27] Semi-supervised hierarchical clustering algorithms
    Amar, A
    Labzour, NT
    Bensaid, A
    SIXTH SCANDINAVIAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1997, 40 : 232 - 239
  • [28] Input validation for semi-supervised clustering
    Yip, Kevin Y.
    Ng, Michael K.
    Cheung, David W.
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 479 - 483
  • [29] Research Progress on Semi-Supervised Clustering
    Qin, Yue
    Ding, Shifei
    Wang, Lijuan
    Wang, Yanru
    COGNITIVE COMPUTATION, 2019, 11 (05) : 599 - 612
  • [30] A survey on semi-supervised graph clustering
    Daneshfar, Fatemeh
    Soleymanbaigi, Sayvan
    Yamini, Pedram
    Amini, Mohammad Sadra
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133 (133)