Performance of Rand's C statistics in clustering analysis: an application to clustering the regions of Turkey

被引:65
|
作者
Saracli, Sinan [1 ]
机构
[1] Afyon Kocatepe Univ, Dept Stat, Fac Arts & Sci, TR-03200 Afyon, Turkey
关键词
Rand's C statistics; hierarchical clustering methods; distance measures; HIERARCHICAL METHODS; FUNCTIONAL DATA; MONTE-CARLO;
D O I
10.1186/1029-242X-2013-142
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Purpose: When a clustering problem is encountered, the researcher must be aware that choosing an incorrect clustering method and distance measure may significantly affect the results of the analysis. The purpose of this study is to determine the best clustering method and distance measure in cluster analysis and to cluster the regions of Turkey on the basis of this result. Methods: In hierarchical clustering, there are several clustering methods and distance measures. For comparison of the clustering methods and distance measures, Rand's C statistic is one of the best methods. Rand's comparative statistic C takes on values from 0.0 to 1.0 inclusive that may be used to compare two resultant clusterings produced by applying clustering methods to a data set with unknown structure or to assess the performance of a clustering method on a data set with known structure. Results: In this study, the seven regions of Turkey are clustered by all the clustering methods and distance measures. Related with the social and economic indicators, the final cluster number is taken as three. Then, according to Rand's C statistics, all possible pairs of distance measures for all clustering methods in hierarchical clustering are compared, and the results are given in the related tables. Conclusions: According to the results of all possible comparisons, Ward's method is found to be the best among others, and final clustering of the regions is applied according to Ward's clustering measure.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] AN APPLICATION OF CLUSTERING FOR STRATEGIC GROUP-ANALYSIS
    HARRIGAN, KR
    STRATEGIC MANAGEMENT JOURNAL, 1985, 6 (01) : 55 - 73
  • [42] An approach on the application of clustering analysis to equipment evaluation
    Zhou, XC
    Jun, X
    Xiao, HL
    CONDITION MONITORING '99, PROCEEDINGS, 1999, : 105 - 111
  • [43] The Application of Fuzzy Clustering Analysis in Ore Classification
    Wang, Cuixiang
    MANUFACTURING PROCESS AND EQUIPMENT, PTS 1-4, 2013, 694-697 : 2860 - 2864
  • [45] Analysis of Fuzzy Clustering of Innovative Capability in Different Regions in China
    Zhong Shen
    Wang Chunyu
    Wang Fang
    ADVANCES IN MANAGEMENT OF TECHNOLOGY, PT 1, 2008, : 141 - 145
  • [46] A Novel Unascertained C-means Clustering with Application
    Shi, Huawang
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 134 - 137
  • [47] Performance Analysis of Clustering Based Genetic Algorithm
    Najeeb, Athaur Rahman
    Aibinu, A. M.
    Nwohu, M. N.
    Salami, M. J. E.
    Salau, H. Bello
    PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 327 - 331
  • [48] Performance analysis of 2-tier clustering
    Hwang, JunWon
    Song, DooHeon
    Lee, ChangHoon
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2006, : 542 - 547
  • [49] Performance Analysis of Clustering Algorithms in Medical Datasets
    Premalatha, P.
    Subasree, S.
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [50] STUDENT PERFORMANCE ANALYSIS USING CLUSTERING ALGORITHM
    Singh, Ishwank
    Sabitha, A. Sai
    Bansal, Abhay
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 294 - 299