Automatic centroid initialization in k-means using artificial hummingbird algorithm

被引:0
|
作者
Kusum Preeti [1 ]
undefined Deep [2 ]
机构
[1] Indian Institute of Technology Roorkee,Department of Mathematics
[2] The University of Tennessee Health Science Centre,undefined
关键词
K-means; Nature inspired algorithm; Data clustering; Clustering analysis;
D O I
10.1007/s00521-024-10764-4
中图分类号
学科分类号
摘要
K-means is a widely used technique that heavily relies on the initial cluster centroid location. Poorly chosen centroids can cause the algorithm to get trapped in suboptimal solutions. Additionally, determining the optimal number of clusters for large datasets is computationally expensive. To address these challenges, a recently developed Artificial Hummingbird Algorithm (AHA) is used to initialize cluster centroid locations and automatically determine the best estimate for the number of clusters. AHA simulates the specialized flight skills and intelligent foraging strategies of hummingbirds, striking a fine balance between exploration and exploitation during the search process. Unlike other data clustering approaches that use a fixed threshold in heuristic methods, we propose a dynamic threshold based on the variance of the data with respect to its centroids for activating cluster centroids in AHA. The data are automatically partitioned into k cluster centroids such that cohesion, measured by cluster diameters, and separation, measured by nearest neighbor distance, are optimized. The algorithm is tested on various datasets, including real-world data, fundamental clustering benchmarks, synthetic data, and high-dimensional data. To evaluate performance, metrics such as fitness value, inter-cluster distance, and intra-cluster distance were used. Results indicate that the proposed method ranked first and achieved superior clustering performance compared to state-of-the-art algorithms.
引用
收藏
页码:3373 / 3398
页数:25
相关论文
共 50 条
  • [21] An Improved Heuristic K-Means Clustering Method Using Genetic Algorithm Based Initialization
    Mustafi, D.
    Sahoo, G.
    Mustafi, A.
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, 2017, 509 : 123 - 132
  • [22] An Initialization Scheme for Supervized K-means
    Lemaire, Vincent
    Ismaili, Oumaima Alaoui
    Cornuejols, Antoine
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [23] Spectral method of K-means initialization
    Qian, Xian
    Huang, Xuan-Jing
    Wu, Li-De
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (04): : 342 - 346
  • [24] Importance of Initialization in K-Means Clustering
    Gupta, Anubhav
    Tomer, Antriksh
    Dahiya, Sonika
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [25] The New K-Means Initialization Method
    Brejna, Bartosz
    Pietranik, Marcin
    Kozierkiewicz, Adrianna
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, ICCCI 2024, 2024, 14810 : 372 - 381
  • [26] Automatic Lung Segmentation By Using Histogram Based K-means Algorithm
    Dincer, Esra
    Duru, Nevcihan
    2016 ELECTRIC ELECTRONICS, COMPUTER SCIENCE, BIOMEDICAL ENGINEERINGS' MEETING (EBBT), 2016,
  • [27] Initial Centroid Selection Method for an Enhanced K-means Clustering Algorithm
    Aamer, Youssef
    Benkaouz, Yahya
    Ouzzif, Mohammed
    Bouragba, Khalid
    UBIQUITOUS NETWORKING, UNET 2019, 2020, 12293 : 182 - 190
  • [28] AHA-3WKM: The optimization of K-means with three-way clustering and artificial hummingbird algorithm
    Chen, Xiying
    Liu, Caihui
    Lin, Bowen
    Lai, Jianying
    Miao, Duoqian
    INFORMATION SCIENCES, 2024, 672
  • [29] A comparative study of efficient initialization methods for the k-means clustering algorithm
    Celebi, M. Emre
    Kingravi, Hassan A.
    Vela, Patricio A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (01) : 200 - 210
  • [30] An Optimized Initialization Center K-means Clustering Algorithm based on Density
    Yuan, Qilong
    Shi, Haibo
    Zhou, Xiaofeng
    2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 790 - 794