Constraint selection by committee: an ensemble approach to identifying informative constraints for semi-supervised clustering

被引:0
|
作者
Greene, Derek [1 ]
Cunningham, Padraig [1 ]
机构
[1] Univ Coll Dublin, Dublin 2, Ireland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supervision is available. This prior knowledge is frequently provided in the form of pairwise must-link and cannot-link constraints. While the incorporation of pairwise supervision has the potential to improve clustering accuracy, the composition and cardinality of the constraint sets can significantly impact upon the level of improvement. We demonstrate that it is often possible to correctly "guess" a large number of constraints without supervision from the co-associations between pairs of objects in an ensemble of clusterings. Along the same lines, we establish that constraints based on pairs with uncertain co-associations are particularly informative, if known. An evaluation on text data shows that this provides an effective criterion for identifying constraints, leading to a reduction in the level of supervision required to direct a clustering algorithm to an accurate solution.
引用
收藏
页码:140 / +
页数:2
相关论文
共 50 条
  • [1] A HYBRID APPROACH TO SELECTING INFORMATIVE CONSTRAINTS FOR SEMI-SUPERVISED CLUSTERING
    Ni, Xianhua
    Yang, Yan
    UNCERTAINTY MODELING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2012, 7 : 833 - 838
  • [2] Constraint Selection for Semi-supervised Topological Clustering
    Allab, Kais
    Benabdeslem, Khalid
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 28 - 43
  • [3] Constraint projections for semi-supervised spectral clustering ensemble
    Yang, Jingya
    Sun, Linfu
    Wu, Qishi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (20):
  • [4] Automated Constraint Selection for Semi-supervised Clustering Algorithm
    Ruiz, Carlos
    Vallejo, Carlos G.
    Spiliopoulou, Myra
    Menasalvas, Ernestina
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2010, 5988 : 151 - +
  • [5] Fuzzy Semi-supervised Clustering with Active Constraint Selection
    Novoselova, Natalia
    Tom, Igor
    PATTERN RECOGNITION AND INFORMATION PROCESSING, 2017, 673 : 132 - 139
  • [6] Semi-supervised Selective Clustering Ensemble based on constraint information
    Ma, Tinghuai
    Zhang, Zheng
    Guo, Lei
    Wang, Xin
    Qian, Yurong
    Al-Nabhan, Najla
    NEUROCOMPUTING, 2021, 462 : 412 - 425
  • [7] Semi-Supervised Ensemble Clustering Based on Selected Constraint Projection
    Yu, Zhiwen
    Luo, Peinan
    Liu, Jiming
    Wong, Hau-San
    You, Jane
    Han, Guoqiang
    Zhang, Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (12) : 2394 - 2407
  • [8] Exploiting constraint inconsistence for dimension selection in subspace clustering: A semi-supervised approach
    Zhang, Xianchao
    Qiu, Yang
    Wu, Yao
    NEUROCOMPUTING, 2011, 74 (17) : 3598 - 3608
  • [9] Semi-Supervised Clustering Ensemble Based on Cluster Consensus Selection
    Liu, Yanxi
    Al-Khafaji, Ali Hussein Demin
    CYBERNETICS AND SYSTEMS, 2025, 56 (03) : 213 - 241
  • [10] Semi-supervised spectral clustering ensemble
    1600, ICIC Express Letters Office (10):