An examination of indexes for determining the number of clusters in binary data sets

被引:0
|
作者
Evgenia Dimitriadou
Sara Dolničar
Andreas Weingessel
机构
[1] Technische Universität Wien,Institut für Statistik und Wahrscheinlichkeitstheorie
[2] Wirtschaftsuniversität wien,Institut für Tourismus und Freizeitwirtschaft
来源
Psychometrika | 2002年 / 67卷
关键词
number of clusters; clustering indexes; binary data; artificial data sets; market segmentation;
D O I
暂无
中图分类号
学科分类号
摘要
The problem of choosing the correct number of clusters is as old as cluster analysis itself. A number of authors have suggested various indexes to facilitate this crucial decision. One of the most extensive comparative studies of indexes was conducted by Milligan and Cooper (1985). The present piece of work pursues the same goal under different conditions. In contrast to Milligan and Cooper's work, the emphasis here is on high-dimensional empirical binary data. Binary artificial data sets are constructed to reflect features typically encountered in real-world data situations in the field of marketing research. The simulation includes 162 binary data sets that are clustered by two different algorithms and lead to recommendations on the number of clusters for each index under consideration. Index results are evaluated and their performance is compared and analyzed.
引用
收藏
页码:137 / 159
页数:22
相关论文
共 50 条
  • [41] Trail-and-error approach for determining the number of clusters
    Sun, Haojun
    Sun, Mei
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 229 - 238
  • [42] Determining the number of clusters using the weighted gap statistic
    Yan, Mingjin
    Ye, Keying
    BIOMETRICS, 2007, 63 (04) : 1031 - 1037
  • [43] Curvature-based method for determining the number of clusters
    Zhang, Yaqian
    Mandziuk, Jacek
    Quek, Chai Hiok
    Goh, Boon Wooi
    INFORMATION SCIENCES, 2017, 415 : 414 - 428
  • [44] Determining the Correct Number of Clusters in the CT Image Segmentation
    Li, Qi
    Yue, Shihong
    Ding, Mingliang
    Li, Jia
    Wang, Zeying
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2020, 10 (11) : 2675 - 2680
  • [45] A new evolutionary algorithm for determining the optimal number of clusters
    Lu, Wei
    Traore, Issa
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 648 - +
  • [46] Application of Rosette Pattern for Clustering and Determining the Number of Clusters
    Sadr, Ali
    Momtaz, Amir Keyvan
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2011, 11 (03) : 77 - 84
  • [47] A Comparative Study of Determining the Number of Clusters with a Method Proposed
    Chae, Seong San
    Lim, Nam Kyoo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2005, 18 (02) : 329 - 341
  • [48] A Spectral Clustering Algorithm for Automatically Determining Clusters Number
    Chen, Bin
    Wang, Ya-lin
    Gong, Fan-ying
    Wang, Xiao-li
    Yang, Chun-hua
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 3723 - 3728
  • [49] A consistent procedure for determining the number of clusters in regression clustering
    Shao, Q
    Wu, Y
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 135 (02) : 461 - 476
  • [50] A method of dynamically determining the number of clusters and cluster centers
    Shao Xiongkai
    Pi Ling
    Liu Lianzhou
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 283 - 286