Cluster ensemble selection with constraints

被引:30
|
作者
Yang, Fan [1 ]
Li, Tao [2 ,4 ]
Zhou, Qifeng [1 ]
Xiao, Han [3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
[2] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL 33199 USA
[3] Aalto Univ, Dept Comp Sci, Espoo, Finland
[4] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Cluster ensemble; Semi-supervised; Constraint; Ensemble selection; CONSENSUS;
D O I
10.1016/j.neucom.2017.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering ensemble has emerged as an important tool for data analysis, by which a more robust and accurate consensus clustering can be generated. On forming the ensembles, empirical studies have suggested that better ensembles can be obtained by simultaneously considering the quality of the ensembles and the diversity among ensemble members. However, little research efforts have been paid to incorporate prior background knowledge. In this paper, we first provide a theoretical analysis on the effect of the diversity and quality of the ensemble members. We then propose a unified framework to solve constraint-based clustering ensemble selection problem, where some instance level must-link and cannot-link constraints are given as prior knowledge or background information. We formalize this problem as a combinatorial optimization problem in terms of the consistency under the constraints, the diversity among ensemble members, and the overall quality of ensembles. Our proposed framework brings together two distinct yet interrelated themes from clustering: ensemble clustering and semi-supervised clustering. We study different techniques for searching high-quality solutions. Experiments on benchmark datasets demonstrate the effectiveness of our framework.
引用
收藏
页码:59 / 70
页数:12
相关论文
共 50 条
  • [1] Hierarchical cluster ensemble selection
    Akbari, Ebrahim
    Dahlan, Halina Mohamed
    Ibrahim, Roliana
    Alizadeh, Hosein
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 : 146 - 156
  • [2] Adaptive Cluster Ensemble Selection
    Azimi, Javad
    Fern, Xiaoli
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 992 - 997
  • [3] Average Cluster Consistency for Cluster Ensemble Selection
    Duarte, F. Jorge F.
    Duarte, Joao M. M.
    Fred, Ana L. N.
    Rodrigues, M. Fatima C.
    KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, 2011, 128 : 133 - +
  • [4] CLUSTER ENSEMBLE SELECTION Using Average Cluster Consistency
    Duarte, F. Jorge F.
    Duarte, Joao M. M.
    Rodrigues, M. Fatima C.
    Fred, Ana L. N.
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 85 - +
  • [5] A Clustering Ensemble Method Based on Cluster Selection and Cluster Splitting
    Tang, Yuyang
    Liu, Xiabi
    PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (ICMLC 2018), 2018, : 54 - 58
  • [6] Cluster ensemble selection based on a new cluster stability measure
    Alizadeh, Hosein
    Minaei-Bidgoli, Behrouz
    Parvin, Hamid
    INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 389 - 408
  • [7] Social Network Optimization for Cluster Ensemble Selection
    Zhao, Chenyue
    Alizadeh, Hosein
    Minaei, Behrouz
    Mohamadpoor, Majid
    Parvin, Hamid
    Mahmoudi, Mohammad Reza
    FUNDAMENTA INFORMATICAE, 2020, 176 (01) : 79 - 102
  • [8] Rough Set based Cluster Ensemble Selection
    Wang, Xueen
    Han, Deqiang
    Han, Chongzhao
    2013 16TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2013, : 438 - 444
  • [9] An Improved Adaptive Cluster Ensemble Selection Approach
    Xu S.
    Gao J.
    Hua X.-P.
    Li X.-F.
    Xu J.
    Xu, Sen (xusen@ycit.cn), 2018, Science Press (44): : 2103 - 2112
  • [10] Cluster ensemble selection based on relative validity indexes
    Naldi, M. C.
    Carvalho, A. C. P. L. F.
    Campello, R. J. G. B.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 27 (02) : 259 - 289