Automatic centroid initialization in k-means using artificial hummingbird algorithm

被引:0
|
作者
Kusum Preeti [1 ]
undefined Deep [2 ]
机构
[1] Indian Institute of Technology Roorkee,Department of Mathematics
[2] The University of Tennessee Health Science Centre,undefined
关键词
K-means; Nature inspired algorithm; Data clustering; Clustering analysis;
D O I
10.1007/s00521-024-10764-4
中图分类号
学科分类号
摘要
K-means is a widely used technique that heavily relies on the initial cluster centroid location. Poorly chosen centroids can cause the algorithm to get trapped in suboptimal solutions. Additionally, determining the optimal number of clusters for large datasets is computationally expensive. To address these challenges, a recently developed Artificial Hummingbird Algorithm (AHA) is used to initialize cluster centroid locations and automatically determine the best estimate for the number of clusters. AHA simulates the specialized flight skills and intelligent foraging strategies of hummingbirds, striking a fine balance between exploration and exploitation during the search process. Unlike other data clustering approaches that use a fixed threshold in heuristic methods, we propose a dynamic threshold based on the variance of the data with respect to its centroids for activating cluster centroids in AHA. The data are automatically partitioned into k cluster centroids such that cohesion, measured by cluster diameters, and separation, measured by nearest neighbor distance, are optimized. The algorithm is tested on various datasets, including real-world data, fundamental clustering benchmarks, synthetic data, and high-dimensional data. To evaluate performance, metrics such as fitness value, inter-cluster distance, and intra-cluster distance were used. Results indicate that the proposed method ranked first and achieved superior clustering performance compared to state-of-the-art algorithms.
引用
收藏
页码:3373 / 3398
页数:25
相关论文
共 50 条
  • [1] Greedy centroid initialization for federated K-means
    Yang, Kun
    Amiri, Mohammad Mohammadi
    Kulkarni, Sanjeev R.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (06) : 3393 - 3425
  • [2] Density K-means : A New Algorithm for Centers Initialization for K-means
    Lan, Xv
    Li, Qian
    Zheng, Yi
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 958 - 961
  • [3] DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING
    Celebi, M. Emre
    Kingravi, Hassan A.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [4] AN EFFICIENT K-MEANS CLUSTERING INITIALIZATION USING OPTIMIZATION ALGORITHM
    Divya, V.
    Deepika, R.
    Yamini, C.
    Sobiyaa, P.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [5] An initialization method for the K-Means algorithm using neighborhood model
    Cao, Fuyuan
    Liang, Jiye
    Jiang, Guang
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (03) : 474 - 483
  • [6] Adaptive Initialization Method for K-Means Algorithm
    Yang, Jie
    Wang, Yu-Kai
    Yao, Xin
    Lin, Chin-Teng
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [7] A robust algorithm for cluster initialization using uniform effect of k-Means
    Peng, Liuqing
    Zhang, Junying
    Xu, Jin
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2010, 38 (08): : 73 - 76
  • [8] AN INTELLIGENT INITIALIZATION METHOD FOR THE K-MEANS CLUSTERING ALGORITHM
    Sheu, Jyh-Jian
    Chen, Wei-Ming
    Tsai, Wen-Bin
    Chu, Ko-Tsung
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2551 - 2566
  • [9] HOW THE INITIALIZATION AFFECTS THE STABILITY OF THE k-MEANS ALGORITHM
    Bubeck, Sebastien
    Meila, Marina
    von Luxburg, Ulrike
    ESAIM-PROBABILITY AND STATISTICS, 2012, 16 : 436 - 452
  • [10] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302