A comprehensive study of clustering ensemble weighting based on cluster quality and diversity

被引:60
|
作者
Nazari, Ahmad [1 ]
Dehghan, Ayob [1 ]
Nejatian, Samad [2 ,3 ]
Rezaie, Vahideh [3 ,4 ]
Parvin, Hamid [1 ,5 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
[5] Islamic Azad Univ, Young Researchers & Elite Club, Nourabad Mamasani Branch, Nourabad, Mamasani, Iran
关键词
Data clustering; Clustering ensemble; Consensus function; Weighting; COMBINING MULTIPLE CLUSTERINGS; TRANSFER DISTANCE; SELECTION; CONSENSUS; PARTITIONS;
D O I
10.1007/s10044-017-0676-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering as a major task in data mining is responsible for discovering hidden patterns in unlabeled datasets. Finding the best clustering is also considered as one of the most challenging problems in data mining. Due to the problem complexity and the weaknesses of primary clustering algorithm, a large part of research has been directed toward ensemble clustering methods. Ensemble clustering aggregates a pool of base clusterings and produces an output clustering that is also named consensus clustering. The consensus clustering is usually better clustering than the output clusterings of the basic clustering algorithms. However, lack of quality in base clusterings makes their consensus clustering weak. In spite of some researches in selection of a subset of high quality base clusterings based on a clustering assessment metric, cluster-level selection has been always ignored. In this paper, a new clustering ensemble framework has been proposed based on cluster-level weighting. The certainty amount that the given ensemble has about a cluster is considered as the reliability of that cluster. The certainty amount that the given ensemble has about a cluster is computed by the accretion amount of that cluster by the ensemble. Then by selecting the best clusters and assigning a weight to each selected cluster based on its reliability, the final ensemble is created. After that, the paper proposes cluster-level weighting co-association matrix instead of traditional co-association matrix. Then, two consensus functions have been introduced and used for production of the consensus partition. The proposed framework completely overshadows the state-of-the-art clustering ensemble methods experimentally.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [21] Sparse dual-weighting ensemble clustering
    Xu, Pan
    Gao, Hui
    Wang, Yixuan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [22] Study on the Influence of Diversity and Quality in Entropy Based Collaborative Clustering
    Sublime, Jeremie
    Cabanes, Guenael
    Matei, Basarab
    ENTROPY, 2019, 21 (10)
  • [23] A fuzzy clustering ensemble based on cluster clustering and iterative Fusion of base clusters
    Musa Mojarad
    Samad Nejatian
    Hamid Parvin
    Majid Mohammadpoor
    Applied Intelligence, 2019, 49 : 2567 - 2581
  • [24] A fuzzy clustering ensemble based on cluster clustering and iterative Fusion of base clusters
    Mojarad, Musa
    Nejatian, Samad
    Parvin, Hamid
    Mohammadpoor, Majid
    APPLIED INTELLIGENCE, 2019, 49 (07) : 2567 - 2581
  • [25] Weighting cluster ensembles in evidence accumulation clustering
    Duarte, F. Jorge
    Fred, Ana L. N.
    Lourenco, Andre
    Rodrigues, M. Fatima
    2005 PORTUGUESE CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, : 159 - 167
  • [26] The Core Cluster-Based Subspace Weighted Clustering Ensemble
    Huang, Xuan
    Qin, Fang
    Lin, Lin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [27] Clustering Ensemble Algorithm with Cluster Connection Based on Wisdom of Crowds
    Zhang H.
    Gao Y.
    Chen Y.
    Wang Z.
    Gao, Yukun (821566504@qq.com), 2018, Science Press (55): : 2611 - 2619
  • [28] Ensemble clustering and feature weighting in time series data
    Ainaz Bahramlou
    Massoud Reza Hashemi
    Zeinab Zali
    The Journal of Supercomputing, 2023, 79 : 16442 - 16478
  • [29] Ensemble clustering and feature weighting in time series data
    Bahramlou, Ainaz
    Hashemi, Massoud Reza
    Zali, Zeinab
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (15): : 16442 - 16478
  • [30] A Novel Ensemble Clustering Approach with Internal Weighting Strategy
    Zhao, Wenfei
    Lian, Cheng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2521 - 2526