A comprehensive study of clustering ensemble weighting based on cluster quality and diversity

被引:60
|
作者
Nazari, Ahmad [1 ]
Dehghan, Ayob [1 ]
Nejatian, Samad [2 ,3 ]
Rezaie, Vahideh [3 ,4 ]
Parvin, Hamid [1 ,5 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
[5] Islamic Azad Univ, Young Researchers & Elite Club, Nourabad Mamasani Branch, Nourabad, Mamasani, Iran
关键词
Data clustering; Clustering ensemble; Consensus function; Weighting; COMBINING MULTIPLE CLUSTERINGS; TRANSFER DISTANCE; SELECTION; CONSENSUS; PARTITIONS;
D O I
10.1007/s10044-017-0676-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering as a major task in data mining is responsible for discovering hidden patterns in unlabeled datasets. Finding the best clustering is also considered as one of the most challenging problems in data mining. Due to the problem complexity and the weaknesses of primary clustering algorithm, a large part of research has been directed toward ensemble clustering methods. Ensemble clustering aggregates a pool of base clusterings and produces an output clustering that is also named consensus clustering. The consensus clustering is usually better clustering than the output clusterings of the basic clustering algorithms. However, lack of quality in base clusterings makes their consensus clustering weak. In spite of some researches in selection of a subset of high quality base clusterings based on a clustering assessment metric, cluster-level selection has been always ignored. In this paper, a new clustering ensemble framework has been proposed based on cluster-level weighting. The certainty amount that the given ensemble has about a cluster is considered as the reliability of that cluster. The certainty amount that the given ensemble has about a cluster is computed by the accretion amount of that cluster by the ensemble. Then by selecting the best clusters and assigning a weight to each selected cluster based on its reliability, the final ensemble is created. After that, the paper proposes cluster-level weighting co-association matrix instead of traditional co-association matrix. Then, two consensus functions have been introduced and used for production of the consensus partition. The proposed framework completely overshadows the state-of-the-art clustering ensemble methods experimentally.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [1] A comprehensive study of clustering ensemble weighting based on cluster quality and diversity
    Ahmad Nazari
    Ayob Dehghan
    Samad Nejatian
    Vahideh Rezaie
    Hamid Parvin
    Pattern Analysis and Applications, 2019, 22 : 133 - 145
  • [2] Dependability-based cluster weighting in clustering ensemble
    Najafi, Fatemeh
    Parvin, Hamid
    Mirzaie, Kamal
    Nejatian, Samad
    Rezaie, Vahideh
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 151 - 164
  • [3] Diversity based cluster weighting in cluster ensemble: an information theory approach
    Frouzan Rashidi
    Samad Nejatian
    Hamid Parvin
    Vahideh Rezaie
    Artificial Intelligence Review, 2019, 52 : 1341 - 1368
  • [4] Diversity based cluster weighting in cluster ensemble: an information theory approach
    Rashidi, Frouzan
    Nejatian, Samad
    Parvin, Hamid
    Rezaie, Vahideh
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 1341 - 1368
  • [5] Elite fuzzy clustering ensemble based on clustering diversity and quality measures
    Bagherinia, Ali
    Minaei-Bidgoli, Behrooz
    Hossinzadeh, Mehdi
    Parvin, Hamid
    APPLIED INTELLIGENCE, 2019, 49 (05) : 1724 - 1747
  • [6] Elite fuzzy clustering ensemble based on clustering diversity and quality measures
    Ali Bagherinia
    Behrooz Minaei-Bidgoli
    Mehdi Hossinzadeh
    Hamid Parvin
    Applied Intelligence, 2019, 49 : 1724 - 1747
  • [7] Clustering ensemble selection considering quality and diversity
    Abbasi, Sadr-olah
    Nejatian, Samad
    Parvin, Hamid
    Rezaie, Vahideh
    Bagherifard, Karamolah
    ARTIFICIAL INTELLIGENCE REVIEW, 2019, 52 (02) : 1311 - 1340
  • [8] Clustering ensemble selection considering quality and diversity
    Sadr-olah Abbasi
    Samad Nejatian
    Hamid Parvin
    Vahideh Rezaie
    Karamolah Bagherifard
    Artificial Intelligence Review, 2019, 52 : 1311 - 1340
  • [9] Cluster ensemble selection based on maximum quality-maximum diversity
    Golalipour, Keyvan
    Akbari, Ebrahim
    Motameni, Homayun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [10] Cluster's Quality Evaluation and Selective Clustering Ensemble
    Li, Feijiang
    Qian, Yuhua
    Wang, Jieting
    Dang, Chuangyin
    Liu, Bing
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (05)