A comprehensive study of clustering ensemble weighting based on cluster quality and diversity

被引:60
|
作者
Nazari, Ahmad [1 ]
Dehghan, Ayob [1 ]
Nejatian, Samad [2 ,3 ]
Rezaie, Vahideh [3 ,4 ]
Parvin, Hamid [1 ,5 ]
机构
[1] Islamic Azad Univ, Yasooj Branch, Dept Comp Engn, Yasuj, Iran
[2] Islamic Azad Univ, Yasooj Branch, Dept Elect Engn, Yasuj, Iran
[3] Islamic Azad Univ, Yasooj Branch, Young Researchers & Elite Club, Yasuj, Iran
[4] Islamic Azad Univ, Yasooj Branch, Dept Math, Yasuj, Iran
[5] Islamic Azad Univ, Young Researchers & Elite Club, Nourabad Mamasani Branch, Nourabad, Mamasani, Iran
关键词
Data clustering; Clustering ensemble; Consensus function; Weighting; COMBINING MULTIPLE CLUSTERINGS; TRANSFER DISTANCE; SELECTION; CONSENSUS; PARTITIONS;
D O I
10.1007/s10044-017-0676-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering as a major task in data mining is responsible for discovering hidden patterns in unlabeled datasets. Finding the best clustering is also considered as one of the most challenging problems in data mining. Due to the problem complexity and the weaknesses of primary clustering algorithm, a large part of research has been directed toward ensemble clustering methods. Ensemble clustering aggregates a pool of base clusterings and produces an output clustering that is also named consensus clustering. The consensus clustering is usually better clustering than the output clusterings of the basic clustering algorithms. However, lack of quality in base clusterings makes their consensus clustering weak. In spite of some researches in selection of a subset of high quality base clusterings based on a clustering assessment metric, cluster-level selection has been always ignored. In this paper, a new clustering ensemble framework has been proposed based on cluster-level weighting. The certainty amount that the given ensemble has about a cluster is considered as the reliability of that cluster. The certainty amount that the given ensemble has about a cluster is computed by the accretion amount of that cluster by the ensemble. Then by selecting the best clusters and assigning a weight to each selected cluster based on its reliability, the final ensemble is created. After that, the paper proposes cluster-level weighting co-association matrix instead of traditional co-association matrix. Then, two consensus functions have been introduced and used for production of the consensus partition. The proposed framework completely overshadows the state-of-the-art clustering ensemble methods experimentally.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [41] Fuzzy clustering ensemble considering cluster dependability
    School of Information Engineering, China University of Geosciences , Beijing, China
    不详
    不详
    不详
    不详
    不详
    不详
    Int. J. on Artif. Intell. Tools, 2021, 2
  • [42] Fair Clustering Ensemble With Equal Cluster Capacity
    Zhou, Peng
    Li, Rongwen
    Ling, Zhaolong
    Du, Liang
    Liu, Xinwang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1729 - 1746
  • [43] Water Quality Prediction Based on Machine Learning and Comprehensive Weighting Methods
    Wang, Xianhe
    Li, Ying
    Qiao, Qian
    Tavares, Adriano
    Liang, Yanchun
    ENTROPY, 2023, 25 (08)
  • [44] Clustering Categorical Data:A Cluster Ensemble Approach
    何增友
    High Technology Letters, 2003, (04) : 8 - 12
  • [45] Fuzzy Clustering Ensemble Considering Cluster Dependability
    Chen, Zhong
    Bagherinia, Ali
    Minaei-Bidgoli, Behrooz
    Parvin, Hamid
    Pho, Kim-Hung
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2021, 30 (02)
  • [46] A new selection strategy for selective cluster ensemble based on Diversity and Independency
    Yousefnezhad, Muhammad
    Reihanian, Ali
    Zhang, Daoqiang
    Minaei-Bidgoli, Behrouz
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 56 : 260 - 272
  • [47] CLUSTER BASED ENSEMBLE CLASSIFIER GENERATION BY JOINT OPTIMIZATION OF ACCURACY AND DIVERSITY
    Rahman, Ashfaqur
    Verma, Brijesh
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2013, 12 (04)
  • [48] Comprehensive cluster analysis with Transitivity Clustering
    Tobias Wittkop
    Dorothea Emig
    Anke Truss
    Mario Albrecht
    Sebastian Böcker
    Jan Baumbach
    Nature Protocols, 2011, 6 : 285 - 295
  • [49] Comprehensive cluster analysis with Transitivity Clustering
    Wittkop, Tobias
    Emig, Dorothea
    Truss, Anke
    Albrecht, Mario
    Boecker, Sebastian
    Baumbach, Jan
    NATURE PROTOCOLS, 2011, 6 (03) : 285 - 295
  • [50] Power quality comprehensive evaluation based on the clustering method
    Jiang, De-Long
    Wang, Ke-Wen
    Yang, Ping
    Cui, Wei
    Jiang, D.-L. (terrific117@163.com), 1600, Power System Protection and Control Press (40): : 105 - 111