A GRAPH-THEORETIC CRITERION FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET

被引:9
|
作者
KROLAKSCHWERDT, S [1 ]
ECKES, T [1 ]
机构
[1] UNIV GESAMTHSCH WUPPERTAL, W-5600 WUPPERTAL 1, GERMANY
关键词
D O I
10.1207/s15327906mbr2704_3
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This article is concerned with procedures for determining the number of clusters in a data set. Most of the procedures or stopping rules currently in use involve finding internally coherent and externally isolated clusters, but do not derive from the formal structure of the respective clustering model. Based on the graph theoretic concepts of minimal spanning tree, maximal spanning tree, and homomorphic function, a new criterion is advanced that yields a well-defined clustering solution. Its performance in determining the number of clusters in several empirical data sets is evaluated by comparing it to four prominent stopping rules. It is shown that the proposed criterion not only possesses mathematically attractive properties but also may contribute to solving the number-of-clusters problem.
引用
收藏
页码:541 / 565
页数:25
相关论文
共 50 条