Constraint selection by committee: an ensemble approach to identifying informative constraints for semi-supervised clustering

被引:0
|
作者
Greene, Derek [1 ]
Cunningham, Padraig [1 ]
机构
[1] Univ Coll Dublin, Dublin 2, Ireland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of clustering algorithms have been proposed for use in tasks where a limited degree of supervision is available. This prior knowledge is frequently provided in the form of pairwise must-link and cannot-link constraints. While the incorporation of pairwise supervision has the potential to improve clustering accuracy, the composition and cardinality of the constraint sets can significantly impact upon the level of improvement. We demonstrate that it is often possible to correctly "guess" a large number of constraints without supervision from the co-associations between pairs of objects in an ensemble of clusterings. Along the same lines, we establish that constraints based on pairs with uncertain co-associations are particularly informative, if known. An evaluation on text data shows that this provides an effective criterion for identifying constraints, leading to a reduction in the level of supervision required to direct a clustering algorithm to an accurate solution.
引用
收藏
页码:140 / +
页数:2
相关论文
共 50 条
  • [21] Combined constraint-based with metric-based in semi-supervised clustering ensemble
    Siting Wei
    Zhixin Li
    Canlong Zhang
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 1085 - 1100
  • [22] Semi-supervised hierarchical ensemble clustering based on an innovative distance metric and constraint information
    Shen, Baohua
    Jiang, Juan
    Qian, Feng
    Li, Daoguo
    Ye, Yanming
    Ahmadi, Gholamreza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [23] Combined constraint-based with metric-based in semi-supervised clustering ensemble
    Wei, Siting
    Li, Zhixin
    Zhang, Canlong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (07) : 1085 - 1100
  • [24] Semi-Supervised EEG Clustering With Multiple Constraints
    Dai, Chenglong
    Wu, Jia
    Monaghan, Jessica J. M.
    Li, Guanghui
    Peng, Hao
    Becker, Stefanie I.
    McAlpine, David
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8529 - 8544
  • [25] Semi-supervised Clustering with Pairwise and Size Constraints
    Zhang, Shaohong
    Wong, Hau-San
    Xie, Dongqing
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2450 - 2457
  • [26] Active Learning of Constraints for Semi-Supervised Clustering
    Xiong, Sicheng
    Azimi, Javad
    Fern, Xiaoli Z.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 43 - 54
  • [27] Semi-Supervised Clustering Based on Exemplars Constraints
    Wang, Sailan
    Yang, Zhenzhi
    Yang, Jin
    Wang, Hongjun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (06) : 1231 - 1241
  • [28] Semi-supervised DenPeak Clustering with Pairwise Constraints
    Ren, Yazhou
    Hu, Xiaohui
    Shi, Ke
    Yu, Guoxian
    Yao, Dezhong
    Xu, Zenglin
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 837 - 850
  • [29] On the effects of constraints in semi-supervised hierarchical clustering
    Kestler, Hans A.
    Kraus, Johann M.
    Palm, Guenther
    Schwenker, Friedhelm
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2006, 4087 : 57 - 66
  • [30] A classification-based approach to semi-supervised clustering with pairwise constraints
    Smieja, Marek
    Struski, Lukasz
    Figueiredo, Mario A. T.
    NEURAL NETWORKS, 2020, 127 : 193 - 203