Cluster ensemble selection with constraints

被引:30
|
作者
Yang, Fan [1 ]
Li, Tao [2 ,4 ]
Zhou, Qifeng [1 ]
Xiao, Han [3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
[2] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
[3] Aalto Univ, Dept Comp Sci, Espoo, Finland
[4] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Cluster ensemble; Semi-supervised; Constraint; Ensemble selection; CONSENSUS;
D O I
10.1016/j.neucom.2017.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering ensemble has emerged as an important tool for data analysis, by which a more robust and accurate consensus clustering can be generated. On forming the ensembles, empirical studies have suggested that better ensembles can be obtained by simultaneously considering the quality of the ensembles and the diversity among ensemble members. However, little research efforts have been paid to incorporate prior background knowledge. In this paper, we first provide a theoretical analysis on the effect of the diversity and quality of the ensemble members. We then propose a unified framework to solve constraint-based clustering ensemble selection problem, where some instance level must-link and cannot-link constraints are given as prior knowledge or background information. We formalize this problem as a combinatorial optimization problem in terms of the consistency under the constraints, the diversity among ensemble members, and the overall quality of ensembles. Our proposed framework brings together two distinct yet interrelated themes from clustering: ensemble clustering and semi-supervised clustering. We study different techniques for searching high-quality solutions. Experiments on benchmark datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:59 / 70
页数:12
相关论文
共 50 条
  • [31] Ensemble selection by GRASP
    Liu, Zhuan
    Dai, Qun
    Liu, Ningzhong
    APPLIED INTELLIGENCE, 2014, 41 (01) : 128 - 144
  • [32] Bagging Ensemble Selection
    Sun, Quan
    Pfahringer, Bernhard
    AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 251 - 260
  • [33] SELECTION WITH CONSTRAINTS
    HARVILLE, DA
    REEVES, TF
    ALLAIRE, FR
    JOURNAL OF ANIMAL SCIENCE, 1972, 35 (01) : 183 - &
  • [34] Weighted Spectral Cluster Ensemble
    Yousefnezhad, Muhammad
    Zhang, Daoqiang
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 549 - 558
  • [35] Probabilistic cluster structure ensemble
    Yu, Zhiwen
    Li, Le
    Wong, Hau-San
    You, Jane
    Han, Guoqiang
    Gao, Yunjun
    Yu, Guoxian
    INFORMATION SCIENCES, 2014, 267 : 16 - 34
  • [36] Knowledge based Cluster Ensemble
    Yu, Zhiwen
    Deng, Zhongkai
    Wong, Hau-San
    Wang, Xing
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 589 - 594
  • [37] Wisdom of Crowds cluster ensemble
    Alizadeh, Hosein
    Yousefnezhad, Muhammad
    Bidgoli, Behrouz Minaei
    INTELLIGENT DATA ANALYSIS, 2015, 19 (03) : 485 - 503
  • [38] Cluster ensemble Kalman filter
    Smith, Keston W.
    TELLUS SERIES A-DYNAMIC METEOROLOGY AND OCEANOGRAPHY, 2007, 59 (05) : 749 - 757
  • [39] Cluster Expansion in the Canonical Ensemble
    Pulvirenti, Elena
    Tsagkarogiannis, Dimitrios
    COMMUNICATIONS IN MATHEMATICAL PHYSICS, 2012, 316 (02) : 289 - 306
  • [40] Cluster Expansion in the Canonical Ensemble
    Elena Pulvirenti
    Dimitrios Tsagkarogiannis
    Communications in Mathematical Physics, 2012, 316 : 289 - 306