Assessing clustering methods using Shannon's entropy

被引:1
|
作者
Hoayek, Anis [1 ]
Rulliere, Didier [1 ]
机构
[1] Univ Clermont Auvergne, Inst Henri Fayol, CNRS, Mines St Etienne,UMR 6158 LIMOS, F-42023 St Etienne, France
关键词
NUMBER; ALGORITHM;
D O I
10.1016/j.ins.2024.121510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised clustering techniques are crucial for effectively partitioning datasets into meaningful subgroups. In this paper, we introduce a novel clustering fuzziness metric based on Shannon's entropy, which quantifies the level of uncertainty in clustering assignments. This metric is employed to develop a statistical test for evaluating clustering algorithms and to propose parametric corrections to enhance their performance. We provide empirical evidence showing that our approach significantly improves clustering quality compared to well-known algorithms such as Fuzzy K-Means (FKM) and Fanny. For instance, applying our correction method led to a notable decrease in Jensen-Shannon Divergence (JSD) values, indicating enhanced clustering accuracy. Our methodology is demonstrated through comprehensive experiments on both simulated and real-world datasets. Additionally, our approach proves effective in determining the optimal number of clusters, showing superiority over traditional methods. These results highlight the practical benefits of our metric and correction techniques for improving clustering performance.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Confidence intervals for Gini's diversity measure and Shannon's entropy using adjusted proportions
    From, SG
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (05) : 935 - 954
  • [42] Shearer's inequality and infimum rule for Shannon entropy and topological entropy
    Downarowicz, Tomasz
    Frej, Bartosz
    Romagnoli, Pierre-Paul
    DYNAMICS AND NUMBERS, 2016, 669 : 63 - 75
  • [43] The compilation of Shannon entropy measurement equation for nonlinear dynamic systems by using the interval analysis methods
    Machekhin, Yu. P.
    Kurskoy, Yu. S.
    DEVICES AND METHODS OF MEASUREMENTS, 2015, 6 (02): : 257 - 263
  • [44] An Intrinsic Characterization of Shannon's and Rényi's Entropy
    Schlather, Martin
    Ditscheid, Carmen
    ENTROPY, 2024, 26 (12)
  • [45] Technique of Gene Expression Profiles Selection Based on SOTA Clustering Algorithm Using Statistical Criteria and Shannon Entropy
    Babichev, Sergii
    Khamula, Orest
    Durnyak, Bohdan
    Skvor, Jiri
    LECTURE NOTES IN COMPUTATIONAL INTELLIGENCE AND DECISION MAKING (ISDMCI 2020), 2020, 1246 : 23 - 38
  • [46] An Active Learning Algorithm Based on Shannon Entropy for Constraint-Based Clustering
    Chen, Duo Wen
    Jin, Ying Hua
    IEEE ACCESS, 2020, 8 : 171447 - 171456
  • [47] Estimating Shannon's entropy with incomplete species inventories
    Shen, Tsung-Jen
    JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE, 2025,
  • [48] Study of physicochemical parameters and wetland water quality assessment by using Shannon's entropy
    Singh, Kunwar Raghvendra
    Dutta, Rahul
    Kalamdhad, Ajay S.
    Kumar, Bimlesh
    APPLIED WATER SCIENCE, 2022, 12 (11)
  • [49] Study of physicochemical parameters and wetland water quality assessment by using Shannon’s entropy
    Kunwar Raghvendra Singh
    Rahul Dutta
    Ajay S. Kalamdhad
    Bimlesh Kumar
    Applied Water Science, 2022, 12
  • [50] New refinements of Shannon's entropy upper bounds
    Sayyari, Yamin
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2021, 42 (08): : 1869 - 1883