An ensemble agglomerative hierarchical clustering algorithm based on clusters clustering technique and the novel similarity measurement

被引:57
|
作者
Li, Teng [1 ]
Rezaeipanah, Amin [2 ]
El Din, ElSayed M. Tag [3 ]
机构
[1] Chongqing Coll Elect Engn, Artificial Intelligence & Big Data Coll, Chongqing 401331, Peoples R China
[2] Persian Gulf Univ, Dept Comp Engn, Bushehr, Iran
[3] Future Univ Egypt, Fac Engn & Technol, Elect Engn Dept, New Cairo 11845, Egypt
关键词
Hierarchical clustering; Meta-clusters; Ensemble clustering; Model selection; Similarity measurement; Clusters clustering; WEIGHTED ENSEMBLE; DENSITY;
D O I
10.1016/j.jksuci.2022.04.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The advent of architectures such as the Internet of Things (IoT) has led to the dramatic growth of data and the production of big data. Managing this often-unlabeled data is a big challenge for the real world. Hierarchical Clustering (HC) is recognized as an efficient unsupervised approach to unlabeled data analysis. In data mining, HC is a mechanism for grouping data at different scales by creating a dendrogram. One of the most common HC methods is Agglomerative Hierarchical Clustering (AHC) in which clusters are created bottom-up. In addition, ensemble clustering approaches are used today in complex problems due to the weakness of individual clustering methods. Accordingly, we propose a clustering framework using AHC methods based on ensemble approaches, which includes the clusters clustering technique and a novel similarity measurement. The proposed algorithm is a Meta-Clustering Ensemble scheme based on Model Selection (MCEMS). MCEMS uses the bi-weighting policy to solve the model selection associated problem to improve ensemble clustering. Specifically, multiple AHC individual methods cluster the data from different aspects to form the primary clusters. According to the results of different methods, the similarity between the instances is calculated using a novel similarity measurement. The MCEMS scheme involves the creation of meta-clusters by re-clustering of primary clusters. After clusters clustering, the number of optimal clusters is determined by merging similar clusters and considering a threshold. Finally, the similarity of the instances to the meta-clusters is calculated and each instance is assigned to the meta-cluster with the highest similarity to form the final clusters. Simulations have been performed on some datasets from the UCI repository to evaluate MCEMS scheme compared to state-of-the-art algorithms. Extensive experiments clearly prove the superiority of MCEMS over HMM, DSPA and WHAC algorithms based on Wilcoxon test and Cophenetic correlation coefficient. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:3828 / 3842
页数:15
相关论文
共 50 条
  • [31] Cavitation Diagnosis Method for Centrifugal Pumps based on Agglomerative Hierarchical Clustering Algorithm
    Huang H.M.
    Liu Y.
    Wu D.H.
    Wu Y.Z.
    Wu T.X.
    International Journal of Fluid Machinery and Systems, 2023, 16 (01) : 89 - 97
  • [32] Research on Optimal Design of Civil Sensors Based on Agglomerative Hierarchical Clustering Algorithm
    Cheng, Xingyan
    Zhu, Linyan
    Cheng, Yimei
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2024, 31 (05): : 1455 - 1463
  • [33] Finding the Clusters with Potential Value in Financial Time Series based on Agglomerative Hierarchical Clustering
    You, Shi Yang
    Wang, Yu Dan
    Luo, Lin Kai
    Peng, Hong
    2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE), 2016, : 77 - 81
  • [34] Intelligent Logistics Supplier Selection Based On Improved Agglomerative Hierarchical Clustering Algorithm
    Zhang, Yajie
    Lv, Yaqiong
    Tu, Lei
    Hou, Yueqiu
    2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1309 - 1314
  • [35] A new agglomerative hierarchical clustering algorithm implementation based on the map reduce framework
    Gao H.
    Jiang J.
    She L.
    Fu Y.
    International Journal of Digital Content Technology and its Applications, 2010, 4 (03) : 95 - 100
  • [36] The application of agglomerative hierarchical spatial clustering algorithm in tea blending
    Tie, Jun
    Chen, Wenying
    Sun, Chong
    Mao, Tengyue
    Xing, Guanglin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S6059 - S6068
  • [37] The application of agglomerative hierarchical spatial clustering algorithm in tea blending
    Jun Tie
    Wenying Chen
    Chong Sun
    Tengyue Mao
    Guanglin Xing
    Cluster Computing, 2019, 22 : 6059 - 6068
  • [38] A new agglomerative 2-3 Hierarchical Clustering algorithm
    Chelcea, S
    Bertrand, P
    Trousse, B
    INNOVATIONS IN CLASSIFICATION, DATA SCIENCE, AND INFORMATION SYSTEMS, 2005, : 3 - 10
  • [39] Hierarchical Agglomerative Clustering Algorithm method for distributed generation planning
    Vinothkumar, K.
    Selvan, M. P.
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2014, 56 : 259 - 269
  • [40] Model Order Reduction Based on Agglomerative Hierarchical Clustering
    Al-Dabooni, Seaar
    Wunsch, Donald
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (06) : 1881 - 1895