Determining the number of clusters by sampling with replacement

被引:21
|
作者
Tonidandel, S
Overall, JE
机构
[1] Davidson Coll, Dept Psychol, Davidson, NC 28035 USA
[2] Univ Texas, Hlth Sci Ctr, Dept Psychiat, Houston, TX USA
关键词
D O I
10.1037/1082-989X.9.2.238
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
A split-sample replication criterion originally proposed by J. E. Overall and K. N. Magee (1992) as a stopping rule for hierarchical cluster analysis is applied to multiple data sets generated by sampling with replacement from an original simulated primary data set. An investigation of the validity of this bootstrap procedure was undertaken using different combinations of the true number of latent populations, degrees of overlap, and sample sizes. The bootstrap procedure enhanced the accuracy of identifying the true number of latent populations under virtually all conditions. Increasing the size of the resampled data sets relative to the size of the primary data set further increased accuracy. A computer program to implement the bootstrap stopping rule is made available via a referenced Web site.
引用
收藏
页码:238 / 249
页数:12
相关论文
共 50 条
  • [1] Sampling and clustering algorithm for determining the number of clusters based on the rosette pattern
    Sadr, Ali
    Momtaz, Amirkeyvan
    OPTICAL ENGINEERING, 2012, 51 (01)
  • [2] Determining the number of clusters in cluster analysis
    My-Young Cheong
    Hakbae Lee
    Journal of the Korean Statistical Society, 2008, 37 : 135 - 143
  • [3] Deep Embedding for Determining the Number of Clusters
    Wang, Yiqi
    Shi, Zhan
    Guo, Xifeng
    Liu, Xinwang
    Zhu, En
    Yin, Jianping
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8173 - 8174
  • [4] Fuzzy Clustering: Determining the Number of Clusters
    Rezankova, Hana
    Husek, Dusan
    2012 FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ASPECTS OF SOCIAL NETWORKS (CASON), 2012, : 277 - 282
  • [5] Determining the number of clusters in cluster analysis
    Cheong, My-Young
    Lee, Hakbae
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2008, 37 (02) : 135 - 143
  • [6] A NEW APPROACH FOR DETERMINING NUMBER OF CLUSTERS
    Erisoglu, Murat
    Erisoglu, Ulku
    Servi, Tayfun
    Sakallioglu, Sadullah
    PAKISTAN JOURNAL OF STATISTICS, 2012, 28 (01): : 141 - 158
  • [7] DETERMINING THE OPTIMAL NUMBER OF CLUSTERS IN CLUSTER ANALYSIS
    Loster, Tomas
    10TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2016, : 1078 - 1090
  • [8] A Method for Automatically Determining The Number of Clusters of LAC
    Liu, Han
    Wu, Qingfeng
    Dong, Huailin
    Wang, Shuangshuang
    Cai, Qing
    Ma, Zhuo
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 1907 - +
  • [9] Determining the optimum number of increments in composite sampling
    Hathaway, John E.
    Schaalje, G. Bruce
    Gilbert, Richard O.
    Pulsipher, Brent A.
    Matzke, Brett D.
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2008, 15 (03) : 313 - 327
  • [10] Determining the optimum number of increments in composite sampling
    John E. Hathaway
    G. Bruce Schaalje
    Richard O. Gilbert
    Brent A. Pulsipher
    Brett D. Matzke
    Environmental and Ecological Statistics, 2008, 15 : 313 - 327