A GRAPH-THEORETIC CRITERION FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET

被引:9
|
作者
KROLAKSCHWERDT, S [1 ]
ECKES, T [1 ]
机构
[1] UNIV GESAMTHSCH WUPPERTAL, W-5600 WUPPERTAL 1, GERMANY
关键词
D O I
10.1207/s15327906mbr2704_3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem.
引用
收藏
页码:541 / 565
页数:25
相关论文
共 50 条
  • [1] A Graph-Theoretic Approach for Visualization of Data Set Feature Association
    Das, Amit Kumar
    Goswami, Saptarsi
    Chakraborty, Basabi
    Chakrabarti, Amlan
    ADVANCED COMPUTING AND SYSTEMS FOR SECURITY, VOL 4, 2017, 568 : 109 - 124
  • [2] THE GRAPH-THEORETIC APPROACH TO DESCRIPTIVE SET THEORY
    Miller, Benjamin D.
    BULLETIN OF SYMBOLIC LOGIC, 2012, 18 (04) : 554 - 575
  • [3] NOTE ON A GRAPH-THEORETIC CRITERION FOR STRUCTURAL OUTPUT CONTROLLABILITY
    MUROTA, K
    POLJAK, S
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1990, 35 (08) : 939 - 942
  • [4] GRAPH-THEORETIC RELAXATIONS OF SET COVERING AND SET PARTITIONING PROBLEMS
    ELDARZI, E
    MITRA, G
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1995, 87 (01) : 109 - 121
  • [5] VALIDITY OF CLUSTERS FORMED BY GRAPH-THEORETIC CLUSTER METHODS
    DAY, WHE
    MATHEMATICAL BIOSCIENCES, 1977, 36 (3-4) : 299 - 317
  • [6] Species, clusters and the 'Tree of life': A graph-theoretic perspective
    Dress, Andreas
    Moulton, Vincent
    Steel, Mike
    Wu, Taoyang
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 265 (04) : 535 - 542
  • [7] Effects of Resampling in Determining the Number of Clusters in a Data Set
    Rainer Dangl
    Friedrich Leisch
    Journal of Classification, 2020, 37 : 558 - 583
  • [8] Effects of Resampling in Determining the Number of Clusters in a Data Set
    Dangl, Rainer
    Leisch, Friedrich
    JOURNAL OF CLASSIFICATION, 2020, 37 (03) : 558 - 583
  • [9] AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET
    MILLIGAN, GW
    COOPER, MC
    PSYCHOMETRIKA, 1985, 50 (02) : 159 - 179
  • [10] Automatically Determining the Number of Clusters Using Decision-Theoretic Rough Set
    Yu, Hong
    Liu, Zhanguo
    Wang, Guoyin
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 504 - 513