A Method to Determine the Number of Clusters Based on Multi-validity Index

被引:1
|
作者
Sun, Ning [1 ]
Yu, Hong [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
来源
ROUGH SETS, IJCRS 2018 | 2018年 / 11103卷
基金
中国国家自然科学基金;
关键词
Clustering; Uncertainty; Three-way decisions; Number of clusters; Multi-validity index;
D O I
10.1007/978-3-319-99368-3_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster analysis is a method of unsupervised learning technology which is playing a more and more important role in data mining. However, one basic and difficult question for clustering is how to gain the number of clusters automatically. The traditional solution for the problem is to introduce a single validity index which may lead to failure because the index is bias to some specific condition. On the other hand, most of the existing clustering algorithms are based on hard partitioning which can not reflect the uncertainty of the data in the clustering process. To combat these drawbacks, this paper proposes a method to determine the number of clusters automatically based on three-way decision and multi-validity index which includes three parts: (1) the k-means clustering algorithm is devised to obtain the three-way clustering results; (2) multi-validity indexes are employed to evaluate the results and each evaluated result is weighed according to the mean similarity between the corresponding clustering result and the others based on the idea of the median partition in clustering ensemble; and (3) the comprehensive evaluation results are sorted and the best ranked k value is selected as the optional number of clusters. The experimental results show that the proposed method is better than the single evaluation method used in the fusion at determining the number of clusters automatically.
引用
收藏
页码:427 / 439
页数:13
相关论文
共 50 条
  • [1] Validity index and number of clusters
    Saad, Mohamed Fadhel
    Alimi, Adel M.
    [J]. International Journal of Computer Science Issues, 2012, 9 (1 1-3): : 52 - 57
  • [2] A medoid-based deviation ratio index to determine the number of clusters in a dataset
    Kariyam, Adhitya Ronnie
    Abdurakhman
    Effendie, Adhitya Ronnie
    [J]. METHODSX, 2023, 10
  • [3] A novel validity index for determination of the optimal number of clusters
    Kim, DJ
    Park, YW
    Park, DJ
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (02) : 281 - 285
  • [4] On cluster validity index for estimation of the optimal number of fuzzy clusters
    Kim, DW
    Lee, KH
    Lee, DH
    [J]. PATTERN RECOGNITION, 2004, 37 (10) : 2009 - 2025
  • [5] Estimating the Optimal Number of Clusters Via Internal Validity Index
    Zhou, Shibing
    Liu, Fei
    Song, Wei
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1013 - 1034
  • [6] Estimating the Optimal Number of Clusters Via Internal Validity Index
    Shibing Zhou
    Fei Liu
    Wei Song
    [J]. Neural Processing Letters, 2021, 53 : 1013 - 1034
  • [7] An Improved Clustering Validity Index for Determining the Number of Malware Clusters
    Wang, Youyu
    Ye, Yanfang
    Chen, Haishan
    Jiang, Qingshan
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION IN COMMUNICATION, 2009, : 544 - +
  • [8] REBET: a method to determine the number of cell clusters based on batch effect removal
    Fang, Zhao-Yu
    Lin, Cui-Xiang
    Xu, Yun-Pei
    Li, Hong-Dong
    Xu, Qing-Song
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [9] A new validity clustering index-based on finding new centroid positions using the mean of clustered data to determine the optimum number of clusters
    Abdalameer, Ahmed Khaldoon
    Alswaitti, Mohammed
    Alsudani, Ahmed Adnan
    Isa, Nor Ashidi Mat
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [10] An Adaptive Method to Determine the Number of Clusters in Clustering Process
    Huan Doan
    Dinh Thuan Nguyen
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS), 2014,