Clustering ensemble selection considering quality and diversity

被引:68
|
作者
Abbasi, Sadr-olah [1 ]
Nejatian, Samad [2 ,3 ]
Parvin, Hamid [4 ,5 ]
Rezaie, Vahideh [3 ,6 ]
Bagherifard, Karamolah [1 ,3 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Nourabad Mamasani Branch, Dept Comp Engn, Nourabad Mamasani, Iran
[5] Islamic Azad Univ, Nourabad Mamasani Branch, Young Researchers & Elite Club, Nourabad Mamasani, Iran
[6] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
关键词
Clustering ensemble; Stability measure; Improved stability; Evidence accumulation; Extended EAC; Co-association matrix; Cluster evaluation; COMBINING MULTIPLE CLUSTERINGS; VALIDATION; FRAMEWORK; CONSENSUS;
D O I
10.1007/s10462-018-9642-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is highly likely that there is a partition that is judged by a stability measure as a bad one while it contains one (or more) high quality cluster(s); and then it is totally neglected. So, inspiring from the evaluation of partitions, researchers turn to define measures for evaluation of clusters. Many stability measures have been proposed such as Normalized Mutual Information to validate a partition. The defined measures are based on Normalized Mutual Information. The drawback of the commonly used approach will be discussed in this paper and a criterion is proposed to assess the association between a cluster and a partition which is called Edited Normalized Mutual Information, ENMI criterion. The ENMI criterion compensates the drawback of the common Normalized Mutual Information (NMI) measure. Also, a clustering ensemble method that is based on aggregating a subset of primary clusters is proposed. The proposed method uses the Average ENMI as fitness measure to select a number of clusters. The clusters that satisfy a predefined threshold of the mentioned measure are selected to participate in the final ensemble. To combine the chosen clusters a set of consensus function methods are employed. One class of the used consensus functions is the co-association based consensus functions. Since the Evidence Accumulation Clustering, EAC, method can't derive the co-association matrix from a subset of clusters, Extended EAC, EEAC, is employed to construct the co-association matrix from the chosen subset of clusters. The second class of the used consensus functions is based on hyper graph partitioning algorithms. The other class of the used consensus functions considers the chosen clusters as a new feature space and uses a simple clustering algorithm to extract the consensus partitioning. The empirical studies show that the proposed method outperforms other well-known ensembles.
引用
收藏
页码:1311 / 1340
页数:30
相关论文
共 50 条
  • [1] Clustering ensemble selection considering quality and diversity
    Sadr-olah Abbasi
    Samad Nejatian
    Hamid Parvin
    Vahideh Rezaie
    Karamolah Bagherifard
    [J]. Artificial Intelligence Review, 2019, 52 : 1311 - 1340
  • [2] Elite fuzzy clustering ensemble based on clustering diversity and quality measures
    Bagherinia, Ali
    Minaei-Bidgoli, Behrooz
    Hossinzadeh, Mehdi
    Parvin, Hamid
    [J]. APPLIED INTELLIGENCE, 2019, 49 (05) : 1724 - 1747
  • [3] Elite fuzzy clustering ensemble based on clustering diversity and quality measures
    Ali Bagherinia
    Behrooz Minaei-Bidgoli
    Mehdi Hossinzadeh
    Hamid Parvin
    [J]. Applied Intelligence, 2019, 49 : 1724 - 1747
  • [4] Leveraging Frequency and Diversity based Ensemble Selection to Consensus Clustering
    Banerjee, Arko
    [J]. 2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 123 - 129
  • [5] Ensemble Clustering Selection by Optimization of Accuracy-Diversity Trade off
    Akyuz, Sureyya
    Otar, Buse Cisil
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [6] An Ensemble Clustering Framework Based on Hierarchical Clustering Ensemble Selection and Clusters Clustering
    Li, Wenjun
    Wang, Zikang
    Sun, Wei
    Bahrami, Sara
    [J]. CYBERNETICS AND SYSTEMS, 2023, 54 (05) : 741 - 766
  • [7] A comprehensive study of clustering ensemble weighting based on cluster quality and diversity
    Nazari, Ahmad
    Dehghan, Ayob
    Nejatian, Samad
    Rezaie, Vahideh
    Parvin, Hamid
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 133 - 145
  • [8] A comprehensive study of clustering ensemble weighting based on cluster quality and diversity
    Ahmad Nazari
    Ayob Dehghan
    Samad Nejatian
    Vahideh Rezaie
    Hamid Parvin
    [J]. Pattern Analysis and Applications, 2019, 22 : 133 - 145
  • [9] Transfer Clustering Ensemble Selection
    Shi, Yifan
    Yu, Zhiwen
    Chen, C. L. Philip
    You, Jane
    Wong, Hau-San
    Wang, Yide
    Zhang, Jun
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2872 - 2885
  • [10] A Survey: Clustering Ensemble Selection
    Min, Liu Li
    Ping, Fan Xiao
    [J]. MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 2760 - 2763