Some connectivity based cluster validity indices

被引:40
|
作者
Saha, Sriparna [1 ]
Bandyopadhyay, Sanghamitra [2 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
关键词
Clustering; Cluster validity index; Connectivity; Relative neighborhood graph; Single linkage clustering technique; K-means clustering technique; RELATIVE NEIGHBORHOOD GRAPH; PERFORMANCE EVALUATION; STABILITY; SYMMETRY; NUMBER;
D O I
10.1016/j.asoc.2011.12.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of the correct number of clusters and the appropriate partitioning technique are some important considerations in clustering where several cluster validity indices, primarily utilizing the Euclidean distance, have been used in the literature. In this paper a new measure of connectivity is incorporated in the definitions of seven cluster validity indices namely, DB-index, Dunn-index, Generalized Dunn-index, PS-index, I-index, XB-index and SV-index, thereby yielding seven new cluster validity indices which are able to automatically detect clusters of any shape, size or convexity as long as they are well-separated. Here connectivity is measured using a novel approach following the concept of relative neighborhood graph. It is empirically established that incorporation of the property of connectivity significantly improves the capabilities of these indices in identifying the appropriate number of clusters. The well-known clustering techniques, single linkage clustering technique and K-means clustering technique are used as the underlying partitioning algorithms. Results on eight artificially generated and three real-life data sets show that connectivity based Dunn-index performs the best as compared to all the other six indices. Comparisons are made with the original versions of these seven cluster validity indices. (C) 2011 Elsevier B. V. All rights reserved.
引用
收藏
页码:1555 / 1565
页数:11
相关论文
共 50 条
  • [1] Development of Some Line Symmetry Based Cluster Validity Indices
    Acharya, Sudipta
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    [J]. 2014 INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE ISCMI 2014, 2014, : 24 - 27
  • [2] On some symmetry based validity indices
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    [J]. 2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 697 - 704
  • [3] A new clustering algorithm based on cluster validity indices
    Kim, M
    Ramakrishna, RS
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 322 - 329
  • [4] Biological cluster validity indices based on the Gene Ontology
    Speer, N
    Spieth, C
    Zell, A
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VI, PROCEEDINGS, 2005, 3646 : 429 - 439
  • [5] On fuzzy cluster validity indices
    Wang, Weina
    Zhang, Yunjie
    [J]. FUZZY SETS AND SYSTEMS, 2007, 158 (19) : 2095 - 2117
  • [6] Validity of Internal Cluster Indices
    Vij, Ankit
    Khandnor, Padmavati
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTATION SYSTEM AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTIONS (CSITSS), 2016, : 388 - 395
  • [7] AutoClust: A Framework for Automated Clustering based on Cluster Validity Indices
    Poulakis, Yannis
    Doulkeridis, Christos
    Kyriazis, Dimosthenis
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1220 - 1225
  • [8] Use of line based symmetry for developing cluster validity indices
    Acharya, Sudipta
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    [J]. SOFT COMPUTING, 2016, 20 (09) : 3461 - 3474
  • [9] Use of line based symmetry for developing cluster validity indices
    Sudipta Acharya
    Sriparna Saha
    Sanghamitra Bandyopadhyay
    [J]. Soft Computing, 2016, 20 : 3461 - 3474
  • [10] Integrating cluster validity indices based on data envelopment analysis
    Kim, Boseop
    Lee, Hakyeon
    Kang, Pilsung
    [J]. APPLIED SOFT COMPUTING, 2018, 64 : 94 - 108