From A-to-Z review of clustering validation indices

被引:1
|
作者
Hassan, Bryar A. [1 ,2 ]
Tayfor, Noor Bahjat [3 ]
Hassan, Alla A. [4 ]
Ahmed, Aram M. [1 ]
Rashid, Tarik A. [1 ]
Abdalla, Naz N. [5 ]
机构
[1] Univ Kurdistan Hewler, Sch Sci & Engn, Comp Sci & Engn Dept, Erbil, Iraq
[2] Charmo Univ, Coll Sci, Dept Comp Sci, Chamchamal 46023, Sulaimani, Iraq
[3] Kurdistan Tech Inst, Dept Informat Technol, Sulaimani, Iraq
[4] Sulaimani Polytech Univ, Comp Sci Inst, Dept Database Technol, Sulaimani, Kri, Iraq
[5] Univ Tehran, Fac Math Stat & Comp Sci, Informat Technol Dept, Tehran, Iran
关键词
Data clustering metrics; External clustering validation; Internal clustering validation; Cluster validity indices; K-MEANS; VALIDITY INDEX; INTERNAL INDEX; ALGORITHM; NUMBER;
D O I
10.1016/j.neucom.2024.128198
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data clustering involves identifying latent similarities within a dataset and organizing them into clusters or groups. The outcomes of various clustering algorithms differ as they are susceptible to the intrinsic characteristics of the original dataset, including noise and dimensionality. The effectiveness of such clustering procedures directly impacts the homogeneity of clusters, underscoring the significance of evaluating algorithmic outcomes. Consequently, the assessment of clustering quality presents a significant and complex endeavor. A pivotal aspect affecting clustering validation is the cluster validity metric, which aids in determining the optimal number of clusters. The main goal of this study is to comprehensively review and explain the mathematical operation of internal and external cluster validity indices, but not all, to categorize these indices and to brainstorm suggestions for future advancement of clustering validation research. In addition, we review and evaluate the performance of internal and external clustering validation indices on the most common clustering algorithms, such as the evolutionary clustering algorithm star (ECA*). Finally, we suggest a classification framework for examining the functionality of both internal and external clustering validation measures regarding their ideal values, userfriendliness, responsiveness to input data, and appropriateness across various fields. This classification aids researchers in selecting the appropriate clustering validation measure to suit their specific requirements.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] SKYSCRAPERS FROM A-TO-Z
    SORKIN, M
    [J]. ARCHITECTURAL DESIGN, 1995, (116) : 16 - 29
  • [2] PIGS FROM A-TO-Z - GEIBERT,A
    MENAKER, D
    [J]. NEW YORK TIMES BOOK REVIEW, 1986, : 39 - 39
  • [3] CANNES-94 FROM A-TO-Z
    MARTINI, E
    [J]. CINEFORUM, 1994, 34 (06): : 7 - 17
  • [4] Validation indices for graph clustering
    Günter, S
    Bunke, H
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (08) : 1107 - 1113
  • [5] Validation indices for projective clustering
    Lifei Chen
    Shanjun He
    Qingshan Jiang
    [J]. Frontiers of Computer Science in China, 2009, 3 : 477 - 484
  • [6] Validation indices for projective clustering
    Chen, Lifei
    He, Shanjun
    Jiang, Qingshan
    [J]. FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2009, 3 (04): : 477 - 484
  • [7] PIRANDELLO FROM A-TO-Z - FRENCH - SCIASCIA,L
    AMBROISE, C
    [J]. QUINZAINE LITTERAIRE, 1988, (509): : 8 - 9
  • [8] Normal Aortic Dimensions: From A-to-Z Score
    Wong, Cecillia
    Aurigemma, Gerard P.
    Parker, Matthew W.
    [J]. JOURNAL OF THE AMERICAN SOCIETY OF ECHOCARDIOGRAPHY, 2022, 35 (03) : 275 - 277
  • [9] A-TO-Z INSTEAD OF ASAE
    NOLTE, DF
    [J]. AGRICULTURAL ENGINEERING, 1993, 74 (03): : 4 - 4
  • [10] THE A-TO-Z OF CORPORATE TELEVISION
    GROSS, H
    [J]. TELEVISION QUARTERLY, 1991, 25 (03): : 79 - 83