Automatic centroid initialization in k-means using artificial hummingbird algorithm

被引:0
|
作者
Kusum Preeti [1 ]
undefined Deep [2 ]
机构
[1] Indian Institute of Technology Roorkee,Department of Mathematics
[2] The University of Tennessee Health Science Centre,undefined
关键词
K-means; Nature inspired algorithm; Data clustering; Clustering analysis;
D O I
10.1007/s00521-024-10764-4
中图分类号
学科分类号
摘要
K-means is a widely used technique that heavily relies on the initial cluster centroid location. Poorly chosen centroids can cause the algorithm to get trapped in suboptimal solutions. Additionally, determining the optimal number of clusters for large datasets is computationally expensive. To address these challenges, a recently developed Artificial Hummingbird Algorithm (AHA) is used to initialize cluster centroid locations and automatically determine the best estimate for the number of clusters. AHA simulates the specialized flight skills and intelligent foraging strategies of hummingbirds, striking a fine balance between exploration and exploitation during the search process. Unlike other data clustering approaches that use a fixed threshold in heuristic methods, we propose a dynamic threshold based on the variance of the data with respect to its centroids for activating cluster centroids in AHA. The data are automatically partitioned into k cluster centroids such that cohesion, measured by cluster diameters, and separation, measured by nearest neighbor distance, are optimized. The algorithm is tested on various datasets, including real-world data, fundamental clustering benchmarks, synthetic data, and high-dimensional data. To evaluate performance, metrics such as fitness value, inter-cluster distance, and intra-cluster distance were used. Results indicate that the proposed method ranked first and achieved superior clustering performance compared to state-of-the-art algorithms.
引用
收藏
页码:3373 / 3398
页数:25
相关论文
共 50 条
  • [41] Hierarchical initialization approach for K-Means clustering
    Lu, J. F.
    Tang, J. B.
    Tang, Z. M.
    Yang, J. Y.
    PATTERN RECOGNITION LETTERS, 2008, 29 (06) : 787 - 795
  • [42] A k-means clustering algorithm initialization for unsupervised statistical satellite image segmentation
    Rekik, Ahmed
    Zribi, Mourad
    Benjelloun, Mohammed
    ben Hamida, Ahmed
    2006 1ST IEEE INTERNATIONAL CONFERENCE ON E-LEARNING IN INDUSTRIAL ELECTRONICS, 2006, : 11 - +
  • [43] An Improved Initialization Center K-means Clustering Algorithm Based on Distance and Density
    Duan, Yanling
    Liu, Qun
    Xia, Shuyin
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [44] EFFECTIVE INITIALIZATION OF K-MEANS FOR COLOR QUANTIZATION
    Celebi, M. Emre
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1649 - 1652
  • [45] Automatic Detection of k with Suitable Seed Values for Classic k-means Algorithm Using DE
    Bala, Chayan
    Basu, Tripti
    Dasgupta, Abhijit
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 759 - 765
  • [46] Centroid Selection in Kernel Extreme Learning Machine using K-means
    Singhal, Mona
    Shukla, Sanyam
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 708 - 711
  • [47] VQ codebook design using modified K-means algorithm with feature classification and grouping based initialization
    Lang Wang
    Zhe-Ming Lu
    Long-Hua Ma
    Ya-Pei Feng
    Multimedia Tools and Applications, 2018, 77 : 8495 - 8510
  • [48] VQ codebook design using modified K-means algorithm with feature classification and grouping based initialization
    Wang, Lang
    Lu, Zhe-Ming
    Ma, Long-Hua
    Feng, Ya-Pei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8495 - 8510
  • [49] K-means algorithm based on improved artificial bee colony algorithm
    Yu Z.-J.
    Qin H.
    Yu, Zuo-Jun (yuzj@upc.edu.cn), 2018, Northeast University (33): : 181 - 185
  • [50] Improving spherical k-means for document clustering: Fast initialization, sparse centroid projection, and efficient cluster labeling
    Kim, Hyunjoong
    Kim, Han Kyul
    Cho, Sungzoon
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150