Statistical Comparative Analysis and Evaluation of Validation Indices for Clustering Optimization

被引:0
|
作者
Nguyen, Thy [1 ]
Vichman, Jason [2 ]
Yeboah, Dacosta [1 ]
Olbricht, Gayla R. [2 ]
Obafemi-Ajayi, Tayo [3 ]
机构
[1] Missouri State Univ, Comp Sci Dept, Springfield, MO 65897 USA
[2] Missouri Univ Sci & Technol, Math & Stat Dept, Rolla, MO 65409 USA
[3] Missouri State Univ, Engn Program, Springfield, MO USA
关键词
clustering; validation indices; statistical analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a relevant exploratory tool for a broad range of machine learning applications as it aids identification of meaningful subgroups. For a given clustering algorithm, multiple partitions can be obtained on the same data set by varying algorithmic parameters. Internal validation indices provide a means to objectively evaluate how well groupings obtained from a clustering configuration partitions the data, since there is no prior labeled data. This work presents a rigorous statistical evaluation framework that analyzes performance of internal validation indices based on correlation with external indices. A synthetic data generator that captures a wide range of complexity is proposed. Evaluation is conducted on a varied set of synthetic data types and real data sets to investigate performance of the indices.
引用
收藏
页码:3081 / 3090
页数:10
相关论文
共 50 条
  • [1] Validation indices for graph clustering
    Günter, S
    Bunke, H
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (08) : 1107 - 1113
  • [2] Validation indices for projective clustering
    Lifei Chen
    Shanjun He
    Qingshan Jiang
    [J]. Frontiers of Computer Science in China, 2009, 3 : 477 - 484
  • [3] Validation indices for projective clustering
    Chen, Lifei
    He, Shanjun
    Jiang, Qingshan
    [J]. FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2009, 3 (04): : 477 - 484
  • [4] Analysis of genetic association using hierarchical clustering and cluster validation indices
    Pagnuco, Inti A.
    Pastore, Juan I.
    Abras, Guillermo
    Brun, Marcel
    Ballarin, Virginia L.
    [J]. GENOMICS, 2017, 109 (5-6) : 438 - 445
  • [5] A Comparative Study of Clustering Validation Indices and Maximum Entropy for Sintonization of Automatic Segmentation Techniques
    Hernandez, J.
    Marin, H.
    Tello, E.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2019, 17 (08) : 1229 - 1236
  • [6] On fuzzification and optimization problems of clustering indices
    Sope, Devi Rahmah
    Mitsuhiko, Fujio
    [J]. INTERNATIONAL JOURNAL OF MATHEMATICS FOR INDUSTRY, 2023, 15 (01):
  • [7] Comparison of internal clustering validation indices for prototype-based clustering
    Hämäläinen, Joonas
    Jauhiainen, Susanne
    Kärkkäinen, Tommi
    [J]. Algorithms, 2017, 10 (03):
  • [8] Systematic statistical comparison of comparative molecular similarity indices analysis molecular fields for computer-aided lead optimization
    Dias, Mafalda M.
    Mittal, Ruchi R.
    McKinnon, Ross A.
    Sorich, Michael J.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (05) : 2015 - 2021
  • [9] Performance Evaluation of Some Clustering Indices
    Roy, Parthajit
    Mandal, J. K.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
  • [10] Quality indices for (practical) clustering evaluation
    Cardoso, Margarida G. M. S.
    de Carvalho, Andre Ponce de Leon F.
    [J]. INTELLIGENT DATA ANALYSIS, 2009, 13 (05) : 725 - 740