Performance of Rand's C statistics in clustering analysis: an application to clustering the regions of Turkey

被引:65
|
作者
Saracli, Sinan [1 ]
机构
[1] Afyon Kocatepe Univ, Dept Stat, Fac Arts & Sci, TR-03200 Afyon, Turkey
关键词
Rand's C statistics; hierarchical clustering methods; distance measures; HIERARCHICAL METHODS; FUNCTIONAL DATA; MONTE-CARLO;
D O I
10.1186/1029-242X-2013-142
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Purpose: When a clustering problem is encountered, the researcher must be aware that choosing an incorrect clustering method and distance measure may significantly affect the results of the analysis. The purpose of this study is to determine the best clustering method and distance measure in cluster analysis and to cluster the regions of Turkey on the basis of this result. Methods: In hierarchical clustering, there are several clustering methods and distance measures. For comparison of the clustering methods and distance measures, Rand's C statistic is one of the best methods. Rand's comparative statistic C takes on values from 0.0 to 1.0 inclusive that may be used to compare two resultant clusterings produced by applying clustering methods to a data set with unknown structure or to assess the performance of a clustering method on a data set with known structure. Results: In this study, the seven regions of Turkey are clustered by all the clustering methods and distance measures. Related with the social and economic indicators, the final cluster number is taken as three. Then, according to Rand's C statistics, all possible pairs of distance measures for all clustering methods in hierarchical clustering are compared, and the results are given in the related tables. Conclusions: According to the results of all possible comparisons, Ward's method is found to be the best among others, and final clustering of the regions is applied according to Ward's clustering measure.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Performance of Rand’s C statistics in clustering analysis: an application to clustering the regions of Turkey
    Sinan Saraçli
    Journal of Inequalities and Applications, 2013 (1)
  • [2] Fuzzy order statistics and their application to fuzzy clustering
    Kersten, PR
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1999, 7 (06) : 708 - 712
  • [3] Clustering Performance Analysis
    Karthika, N.
    Janet, B.
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 25 - 39
  • [4] Application of statistics filter method and clustering analysis in fault diagnosis of roller bearings
    Song, L. Y.
    Wang, H. Q.
    Gao, J. J.
    Yang, J. F.
    Liu, W. B.
    Chen, P.
    25TH INTERNATIONAL CONGRESS ON CONDITION MONITORING AND DIAGNOSTIC ENGINEERING (COMADEM 2012), 2012, 364
  • [5] Consensus Clustering Based on a New Probabilistic Rand Index with Application to Subtopic Retrieval
    Carpineto, Claudio
    Romano, Giovanni
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (12) : 2315 - 2326
  • [6] Clustering Analysis of Traffic Accident Risk in Turkey
    Tortum, Ahmet
    Atalay, Ahmet
    IRANIAN JOURNAL OF PUBLIC HEALTH, 2015, 44 (03) : 425 - 426
  • [7] Clustering current climate regions of Turkey by using a multivariate statistical method
    Cem Iyigun
    Murat Türkeş
    İnci Batmaz
    Ceylan Yozgatligil
    Vilda Purutçuoğlu
    Elçin Kartal Koç
    Muhammed Z. Öztürk
    Theoretical and Applied Climatology, 2013, 114 : 95 - 106
  • [8] Clustering current climate regions of Turkey by using a multivariate statistical method
    Iyigun, Cem
    Turkes, Murat
    Batmaz, Inci
    Yozgatligil, Ceylan
    Purutcuoglu, Vilda
    Koc, Elcin Kartal
    Ozturk, Muhammed Z.
    THEORETICAL AND APPLIED CLIMATOLOGY, 2013, 114 (1-2) : 95 - 106
  • [9] The worse clustering performance analysis
    Yu, Jian
    Hao, Pengwei
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 437 - +
  • [10] Performance analysis for clustering algorithms
    Xue, Yu
    Zhao, Binping
    Ma, Tinghuai
    INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (05) : 485 - 493