A cluster validity evaluation method for dynamically determining the near-optimal number of clusters

被引:0
|
作者
Xiangjun Li
Wei Liang
Xinping Zhang
Song Qing
Pei-Chann Chang
机构
[1] Nanchang University,School of Software
[2] Nanchang University,Department of Computer Science and Technology
[3] Jiangxi Vocational College of Industry and Engineering,undefined
[4] The First Affiliated Hospital of Nanchang University,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Cluster validity index; Euclidean distance; Dynamic clustering; Near-optimal number of clusters; Cluster validity evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Cluster validity evaluation is a hot issue in clustering algorithm research. Aiming at determining the optimal number of clusters in cluster validity evaluation, this paper proposes a new cluster validity index Ratio of Deviation of Sum-of-squares and Euclid distance (RDSED), and designs a cluster validity evaluation method based on RDSED which is suitable to dynamically determine the near-optimal number of clusters. Firstly, based on the analysis of the relationships of the intra-class and inter-class, the concepts of sum-of-squares of within-cluster, sum-of-squares of between-cluster, total sum-of-squares, sum of intra-cluster distance and average distance between clusters are proposed, and then a cluster validity index RDSED based on these concepts is constructed. Secondly, a cluster validity evaluation method based on RDSED for dynamically determining the near-optimal number of clusters is designed. In this method, RDSED value is calculated from large to small in the range of clustering number and this index value is used to dynamically terminate the clustering validity verification process, and finally the near-optimal number of clusters and clustering partition results are obtained. Experiment results of artificial datasets and real datasets show that, compared with some classical clustering validity evaluation method, the proposed cluster validity evaluation method can obtain the near-optimal number of clusters that is closest to the real cluster number in most cases and can effectively evaluate clustering partition results.
引用
收藏
页码:9227 / 9241
页数:14
相关论文
共 50 条
  • [1] A cluster validity evaluation method for dynamically determining the near-optimal number of clusters
    Li, Xiangjun
    Liang, Wei
    Zhang, Xinping
    Qing, Song
    Chang, Pei-Chann
    SOFT COMPUTING, 2020, 24 (12) : 9227 - 9241
  • [2] A method of dynamically determining the number of clusters and cluster centers
    Shao Xiongkai
    Pi Ling
    Liu Lianzhou
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 283 - 286
  • [3] DETERMINING THE OPTIMAL NUMBER OF CLUSTERS IN CLUSTER ANALYSIS
    Loster, Tomas
    10TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2016, : 1078 - 1090
  • [4] EVALUATION OF COEFFICIENTS FOR DETERMINING THE OPTIMAL NUMBER OF CLUSTERS IN CLUSTER ANALYSIS ON REAL DATA SETS
    Loster, Tomas
    9TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2015, : 1014 - 1023
  • [5] On cluster validity index for estimation of the optimal number of fuzzy clusters
    Kim, DW
    Lee, KH
    Lee, DH
    PATTERN RECOGNITION, 2004, 37 (10) : 2009 - 2025
  • [6] Cluster Validation Method for Determining the Number of Clusters in Categorical Sequences
    Guo, Gongde
    Chen, Lifei
    Ye, Yanfang
    Jiang, Qingshan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 2936 - 2948
  • [7] Enhanced Cluster Validity Index for the Evaluation of Optimal Number of Clusters for Fuzzy C-Means Algorithm
    Bharill, Neha
    Tiwari, Aruna
    2014 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2014, : 1526 - 1533
  • [8] Determining the number of clusters in cluster analysis
    My-Young Cheong
    Hakbae Lee
    Journal of the Korean Statistical Society, 2008, 37 : 135 - 143
  • [9] Determining the number of clusters in cluster analysis
    Cheong, My-Young
    Lee, Hakbae
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2008, 37 (02) : 135 - 143
  • [10] Method for Determining the Optimal Number of Clusters Based on Agglomerative Hierarchical Clustering
    Zhou, Shibing
    Xu, Zhenyuan
    Liu, Fei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3007 - 3017