Assessing clustering methods using Shannon's entropy

被引:1
|
作者
Hoayek, Anis [1 ]
Rulliere, Didier [1 ]
机构
[1] Univ Clermont Auvergne, Inst Henri Fayol, CNRS, Mines St Etienne,UMR 6158 LIMOS, F-42023 St Etienne, France
关键词
NUMBER; ALGORITHM;
D O I
10.1016/j.ins.2024.121510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised clustering techniques are crucial for effectively partitioning datasets into meaningful subgroups. In this paper, we introduce a novel clustering fuzziness metric based on Shannon's entropy, which quantifies the level of uncertainty in clustering assignments. This metric is employed to develop a statistical test for evaluating clustering algorithms and to propose parametric corrections to enhance their performance. We provide empirical evidence showing that our approach significantly improves clustering quality compared to well-known algorithms such as Fuzzy K-Means (FKM) and Fanny. For instance, applying our correction method led to a notable decrease in Jensen-Shannon Divergence (JSD) values, indicating enhanced clustering accuracy. Our methodology is demonstrated through comprehensive experiments on both simulated and real-world datasets. Additionally, our approach proves effective in determining the optimal number of clusters, showing superiority over traditional methods. These results highlight the practical benefits of our metric and correction techniques for improving clustering performance.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Assessing the level of digital maturity of enterprises in the Central and Eastern European countries using the MCDM and Shannon's entropy methods
    Brodny, Jaroslaw
    Tutak, Magdalena
    PLOS ONE, 2021, 16 (07):
  • [2] Rough set-based clustering with refinement using Shannon's entropy theory
    Chen, Chun-Bao
    Wang, Li-Ya
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2006, 52 (10-11) : 1563 - 1576
  • [3] Assessing RNA-Seq Workflow Methodologies Using Shannon Entropy
    Carels, Nicolas
    BIOLOGY-BASEL, 2024, 13 (07):
  • [4] Nonparametric clustering of discrete probability distributions with generalized Shannon's entropy and heatmap
    Zhang, Jialin
    Shi, Jingyi
    STATISTICS & PROBABILITY LETTERS, 2024, 208
  • [5] Bayesian Shannon Entropy for Assessing Patient's Data Interrelation in Medical Applications
    Martynenko, Alexander
    Pastor, Xavier
    9TH EUROPEAN MEDICAL AND BIOLOGICAL ENGINEERING CONFERENCE, VOL 1, EMBEC 2024, 2024, 112 : 141 - 150
  • [6] Shannon entropy: A novel tool for assessing pentagon drawing in Parkinson's disease
    Brabenec, L.
    Klobusiakova, P.
    Mekyska, J.
    Rektorova, I.
    MOVEMENT DISORDERS, 2021, 36 : S288 - S289
  • [7] Shannon's entropy of partitions determined by hierarchical clustering trees in asymmetry and dimension identification
    Corredor, J. S.
    Quiroz, A. J.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (10) : 5954 - 5966
  • [8] Applying Grey Clustering and Shannon's Entropy to Assess Sediment Quality from a Watershed
    Delgado, Alexi
    Vilchez, Betsy
    Chipana, Fabian
    Trejo, Gerson
    Acari, Renato
    Camarena, Rony
    Galicia, Victor
    Carbajal, Chiara
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (09) : 679 - 688
  • [9] Increasing the Discriminatory Power of DEA Using Shannon's Entropy
    Xie, Qiwei
    Dai, Qianzhi
    Li, Yongjun
    Jiang, An
    ENTROPY, 2014, 16 (03) : 1571 - 1585
  • [10] Configuration of measurement systems using Shannon's entropy function
    Robert-Nicoud, Y
    Raphael, B
    Smith, IFC
    COMPUTERS & STRUCTURES, 2005, 83 (8-9) : 599 - 612