A cluster validity evaluation method for dynamically determining the near-optimal number of clusters

被引:0
|
作者
Xiangjun Li
Wei Liang
Xinping Zhang
Song Qing
Pei-Chann Chang
机构
[1] Nanchang University,School of Software
[2] Nanchang University,Department of Computer Science and Technology
[3] Jiangxi Vocational College of Industry and Engineering,undefined
[4] The First Affiliated Hospital of Nanchang University,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Cluster validity index; Euclidean distance; Dynamic clustering; Near-optimal number of clusters; Cluster validity evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Cluster validity evaluation is a hot issue in clustering algorithm research. Aiming at determining the optimal number of clusters in cluster validity evaluation, this paper proposes a new cluster validity index Ratio of Deviation of Sum-of-squares and Euclid distance (RDSED), and designs a cluster validity evaluation method based on RDSED which is suitable to dynamically determine the near-optimal number of clusters. Firstly, based on the analysis of the relationships of the intra-class and inter-class, the concepts of sum-of-squares of within-cluster, sum-of-squares of between-cluster, total sum-of-squares, sum of intra-cluster distance and average distance between clusters are proposed, and then a cluster validity index RDSED based on these concepts is constructed. Secondly, a cluster validity evaluation method based on RDSED for dynamically determining the near-optimal number of clusters is designed. In this method, RDSED value is calculated from large to small in the range of clustering number and this index value is used to dynamically terminate the clustering validity verification process, and finally the near-optimal number of clusters and clustering partition results are obtained. Experiment results of artificial datasets and real datasets show that, compared with some classical clustering validity evaluation method, the proposed cluster validity evaluation method can obtain the near-optimal number of clusters that is closest to the real cluster number in most cases and can effectively evaluate clustering partition results.
引用
收藏
页码:9227 / 9241
页数:14
相关论文
共 50 条
  • [41] DESIGN METHOD FOR NEAR-OPTIMAL NOR LOGIC CIRCUITS.
    Pessen, D.
    Fluidics Quarterly, 1977, 9 (02): : 1 - 22
  • [42] Investigating cluster validation metrics for optimal number of clusters determination
    Karanikola, Aikaterini
    Liapis, Charalampos M.
    Kotsiantis, Sotiris
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 809 - 824
  • [43] An optimal hierarchically clustering number determining method
    Zhou, Hongfang
    Zhao, Xuehan
    Li, Hongyan
    Wang, Peng
    Qin, Zhentao
    Journal of Computational Information Systems, 2012, 8 (09): : 3791 - 3798
  • [44] REPLICATION AS A RULE FOR DETERMINING THE NUMBER OF CLUSTERS IN HIERARCHICAL CLUSTER-ANALYSIS
    OVERALL, JE
    MAGEE, KN
    APPLIED PSYCHOLOGICAL MEASUREMENT, 1992, 16 (02) : 119 - 128
  • [45] An Approach for Determining the Number of Clusters in a Model-Based Cluster Analysis
    Akogul, Serkan
    Erisoglu, Murat
    ENTROPY, 2017, 19 (09):
  • [46] Curvature-based method for determining the number of clusters
    Zhang, Yaqian
    Mandziuk, Jacek
    Quek, Chai Hiok
    Goh, Boon Wooi
    INFORMATION SCIENCES, 2017, 415 : 414 - 428
  • [47] A Comparative Study of Determining the Number of Clusters with a Method Proposed
    Chae, Seong San
    Lim, Nam Kyoo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2005, 18 (02) : 329 - 341
  • [48] Optimistic Planning with a Limited Number of Action Switches for Near-Optimal Nonlinear Control
    Mathe, Koppany
    Busoniu, Lucian
    Munos, Remi
    De Schutter, Bart
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 3518 - 3523
  • [49] Optimistic planning with an adaptive number of action switches for near-optimal nonlinear control
    Mathe, Koppany
    Busoniu, Lucian
    Munos, Remi
    De Schutter, Bart
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 67 : 355 - 367
  • [50] THE APPLICATION OF PSI-TRANSFORM FOR DETERMINING A NEAR-OPTIMAL PATH IN THE PRESENCE OF POLYHEDRAL OBSTACLES
    SURLA, D
    RACKOVIC, M
    COMPUTING, 1992, 48 (02) : 203 - 212