Some connectivity based cluster validity indices

被引:40
|
作者
Saha, Sriparna [1 ]
Bandyopadhyay, Sanghamitra [2 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
关键词
Clustering; Cluster validity index; Connectivity; Relative neighborhood graph; Single linkage clustering technique; K-means clustering technique; RELATIVE NEIGHBORHOOD GRAPH; PERFORMANCE EVALUATION; STABILITY; SYMMETRY; NUMBER;
D O I
10.1016/j.asoc.2011.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of the correct number of clusters and the appropriate partitioning technique are some important considerations in clustering where several cluster validity indices, primarily utilizing the Euclidean distance, have been used in the literature. In this paper a new measure of connectivity is incorporated in the definitions of seven cluster validity indices namely, DB-index, Dunn-index, Generalized Dunn-index, PS-index, I-index, XB-index and SV-index, thereby yielding seven new cluster validity indices which are able to automatically detect clusters of any shape, size or convexity as long as they are well-separated. Here connectivity is measured using a novel approach following the concept of relative neighborhood graph. It is empirically established that incorporation of the property of connectivity significantly improves the capabilities of these indices in identifying the appropriate number of clusters. The well-known clustering techniques, single linkage clustering technique and K-means clustering technique are used as the underlying partitioning algorithms. Results on eight artificially generated and three real-life data sets show that connectivity based Dunn-index performs the best as compared to all the other six indices. Comparisons are made with the original versions of these seven cluster validity indices. (C) 2011 Elsevier B. V. All rights reserved.
引用
收藏
页码:1555 / 1565
页数:11
相关论文
共 50 条
  • [21] A Data Clustering Tool with Cluster Validity Indices
    Qiao, Haiyan
    Edwards, Brandon
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING AND INFORMATION, 2009, : 303 - 309
  • [22] An extensive comparative study of cluster validity indices
    Arbelaitz, Olatz
    Gurrutxaga, Ibai
    Muguerza, Javier
    Perez, Jesus M.
    Perona, Inigo
    [J]. PATTERN RECOGNITION, 2013, 46 (01) : 243 - 256
  • [23] A note on cluster validity indices SV and OS
    Chen, Guang Hui
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS II, PTS 1-3, 2013, 336-338 : 2199 - 2202
  • [24] Shape-invariant cluster validity indices
    Frederix, G
    Pauwels, EJ
    [J]. ADVANCES IN DATA MINING: APPLICATIONS IN IMAGE MINING, MEDICINE AND BIOTECHNOLOGY, MANAGEMENT AND ENVIRONMENTAL CONTROL, AND TELECOMMUNICATIONS, 2004, 3275 : 96 - 105
  • [25] Particle Swarm Optimization Based Clustering: A Comparison of Different Cluster Validity Indices
    Liu, Ruochen
    Sun, Xiaojuan
    Jiao, Licheng
    [J]. LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT II, 2010, 98 : 66 - 72
  • [26] A new cluster validity index based on connectivity in self-organizing map
    Kim, Sangmin
    Kim, Jaejik
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (05) : 591 - 601
  • [27] Comparative Analysis of Cluster Validity Indices in Identifying Some Possible Genes Mediating Certain Cancers
    Ghosh, Anupam
    Dhara, Bibhas Chandra
    De, Rajat K.
    [J]. MOLECULAR INFORMATICS, 2013, 32 (04) : 347 - 354
  • [28] Role of Cluster Validity Indices in Delineation of Precipitation Regions
    Bhatia, Nikhil
    Sojan, Jency M.
    Simonovic, Slobodon
    Srivastav, Roshan
    [J]. WATER, 2020, 12 (05)
  • [29] Two cluster validity indices for the LAMDA clustering method
    Botia Valderrama, Javier Fernando
    Luis Botia Valderrama, Diego Jose
    [J]. APPLIED SOFT COMPUTING, 2020, 89 (89)
  • [30] Cluster validity indices for mixture hazards regression models
    Chang, Yi-Wen
    Lu, Kang-Ping
    Chang, Shao-Tung
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (02) : 1616 - 1636