Semi-Supervised Consensus Clustering: Reducing Human Effort

被引:1
|
作者
Vogel, Tobias [1 ]
Naumann, Felix [1 ]
机构
[1] Hasso Plattner Inst, Potsdam, Germany
关键词
D O I
10.1109/ICDMW.2014.97
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine-based clustering yields fuzzy results. For example, when detecting duplicates in a dataset, different tools might end up with different clusterings. Eventually, a decision needs to be made, defining which records are in the same cluster, i.e., are duplicates. Such a definitive result is called a Consensus Clustering and can be created by evaluating the clustering attempts against each other and only resolving the disagreements by human experts. Yet, there can be different consensus clusterings, depending on the choice of disagreements presented to the human expert. In particular, they may require a different number of manual inspections. We present a set of strategies to select the smallest set of manual inspections to arrive at a consensus clustering and evaluate their efficiency on a set of real-world and synthetic datasets.
引用
下载
收藏
页码:1095 / 1104
页数:10
相关论文
共 50 条
  • [31] Semi-supervised deep density clustering
    Xu, Xiao
    Hou, Haiwei
    Ding, Shifei
    APPLIED SOFT COMPUTING, 2023, 148
  • [32] Composite kernels for semi-supervised clustering
    Carlotta Domeniconi
    Jing Peng
    Bojun Yan
    Knowledge and Information Systems, 2011, 28 : 99 - 116
  • [33] Semi-supervised Linear Discriminant Clustering
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Lee, Chia-Hoang
    Gou, Fu-Sheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (07) : 989 - 1000
  • [34] SemiSync: Semi-supervised Clustering by Synchronization
    Zhang, Zhong
    Kang, Didi
    Gao, Chongming
    Shao, Junming
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 358 - 362
  • [35] A SUPERVISORY APPROACH TO SEMI-SUPERVISED CLUSTERING
    Conroy, Bryan
    Xi, Yongxin Taylor
    Ramadge, Peter
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1858 - 1861
  • [36] Weighted Semi-supervised Fuzzy Clustering
    Kong, Yi-qing
    Wang, Shi-tong
    FUZZY INFORMATION AND ENGINEERING, VOL 1, 2009, 54 : 465 - 470
  • [37] Categorization Using Semi-Supervised Clustering
    Hu, Jianying
    Singh, Moninder
    Mojsilovic, Aleksandra
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3666 - 3669
  • [38] Semi-supervised deep embedded clustering
    Ren, Yazhou
    Hu, Kangrong
    Dai, Xinyi
    Pan, Lili
    Hoi, Steven C. H.
    Xu, Zenglin
    NEUROCOMPUTING, 2019, 325 : 121 - 130
  • [39] Semi-supervised point prototype clustering
    Bensaid, AM
    Bezdek, JC
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (05) : 625 - 643
  • [40] FISHERVOICE AND SEMI-SUPERVISED SPEAKER CLUSTERING
    Chu, Stephen M.
    Tang, Hao
    Huang, Thomas S.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4089 - +