A cluster validity evaluation method for dynamically determining the near-optimal number of clusters

被引:0
|
作者
Xiangjun Li
Wei Liang
Xinping Zhang
Song Qing
Pei-Chann Chang
机构
[1] Nanchang University,School of Software
[2] Nanchang University,Department of Computer Science and Technology
[3] Jiangxi Vocational College of Industry and Engineering,undefined
[4] The First Affiliated Hospital of Nanchang University,undefined
来源
Soft Computing | 2020年 / 24卷
关键词
Cluster validity index; Euclidean distance; Dynamic clustering; Near-optimal number of clusters; Cluster validity evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Cluster validity evaluation is a hot issue in clustering algorithm research. Aiming at determining the optimal number of clusters in cluster validity evaluation, this paper proposes a new cluster validity index Ratio of Deviation of Sum-of-squares and Euclid distance (RDSED), and designs a cluster validity evaluation method based on RDSED which is suitable to dynamically determine the near-optimal number of clusters. Firstly, based on the analysis of the relationships of the intra-class and inter-class, the concepts of sum-of-squares of within-cluster, sum-of-squares of between-cluster, total sum-of-squares, sum of intra-cluster distance and average distance between clusters are proposed, and then a cluster validity index RDSED based on these concepts is constructed. Secondly, a cluster validity evaluation method based on RDSED for dynamically determining the near-optimal number of clusters is designed. In this method, RDSED value is calculated from large to small in the range of clustering number and this index value is used to dynamically terminate the clustering validity verification process, and finally the near-optimal number of clusters and clustering partition results are obtained. Experiment results of artificial datasets and real datasets show that, compared with some classical clustering validity evaluation method, the proposed cluster validity evaluation method can obtain the near-optimal number of clusters that is closest to the real cluster number in most cases and can effectively evaluate clustering partition results.
引用
收藏
页码:9227 / 9241
页数:14
相关论文
共 50 条
  • [21] Performance evaluation of main approaches for determining optimal number of clusters in wireless sensor networks
    Benmahdi, Meryem Bochra
    Lehsaini, Mohamed
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2020, 33 (03) : 184 - 195
  • [22] A new evolutionary algorithm for determining the optimal number of clusters
    Lu, Wei
    Traore, Issa
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 648 - +
  • [23] Near-optimal dispatching policy for energy-aware server clusters
    Aalto, Samuli
    Lassila, Pasi
    PERFORMANCE EVALUATION, 2019, 135
  • [24] A near-optimal similarity join algorithm and performance evaluation
    Yang, ZW
    Yang, GQ
    INFORMATION SCIENCES, 2004, 167 (1-4) : 87 - 108
  • [25] A Method for Automatically Determining The Number of Clusters of LAC
    Liu, Han
    Wu, Qingfeng
    Dong, Huailin
    Wang, Shuangshuang
    Cai, Qing
    Ma, Zhuo
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1907 - +
  • [26] Near-optimal method for highly smooth convex optimization
    Bubeck, Sebastien
    Jiang, Qijia
    Lee, Yin Tat
    Li, Yuanzhi
    Sidford, Aaron
    CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [27] Near-optimal spatial encoding for dynamically adaptive MRI: Mathematical principles and computational methods
    Zientara, GP
    Panych, LP
    Jolesz, FA
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1999, 10 (02) : 151 - 165
  • [28] Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability
    Sambaturu, Prathyush
    Gupta, Aparna
    Davidson, Ian
    Ravi, S. S.
    Vullikanti, Anil
    Warren, Andrew
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1636 - 1643
  • [29] A bi-objective optimization method to produce a near-optimal number of classifiers and increase diversity in Bagging
    Asadi, Shahrokh
    Roshan, Seyed Ehsan
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [30] Estimating the Optimal Number of Clusters Via Internal Validity Index
    Zhou, Shibing
    Liu, Fei
    Song, Wei
    NEURAL PROCESSING LETTERS, 2021, 53 (02) : 1013 - 1034