Classification-Based Clustering Evaluation

被引:0
|
作者
Whissell, John S. [1 ]
Clarke, Charles L. A. [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
关键词
clustering methods;
D O I
10.1109/ICDM.2013.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The evaluation of clustering quality has proven to be a difficult task. While it is generally agreed that application-specific human assessment can provide a reasonable gold standard for clustering evaluation, the use of human assessors is not practical in many real situations. As a result, machine computable internal clustering quality measures (CQMs) are often used in the evaluation process. However, CQMs have their own drawbacks. Despite their extensive use in clustering research and applications, many CQMs have been shown to lack generality. In this paper we present a new CQM with general applicability. The basis of our CQM is a pattern recognition view of clustering's purpose: the unsupervised prediction of behavior from populations. This purpose translates naturally into our new classifier based CQM which we refer to as informativeness. We show that informativeness can satisfy core CQM axioms defined in prior research. Additionally, we provide experimental support, showing that informativeness can outperform many established CQMs by detecting a larger variety of meaningful structures across a range of synthetic datasets, while at the same time exhibiting good performance on each individual dataset. Our results indicate that informativeness provides a highly general and effective CQM.
引用
下载
收藏
页码:1229 / 1234
页数:6
相关论文
共 50 条
  • [1] A classification-based approach to semi-supervised clustering with pairwise constraints
    Smieja, Marek
    Struski, Lukasz
    Figueiredo, Mario A. T.
    NEURAL NETWORKS, 2020, 127 : 193 - 203
  • [2] CLASSIFICATION-BASED REASONING
    GOMEZ, F
    SEGAMI, C
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1991, 21 (03): : 644 - 659
  • [3] A classification-based review recommender
    O'Mahony, M. P.
    Smyth, B.
    KNOWLEDGE-BASED SYSTEMS, 2010, 23 (04) : 323 - 329
  • [4] Classification-based objective functions
    Rimer, M
    Martinez, T
    MACHINE LEARNING, 2006, 63 (02) : 183 - 205
  • [5] Classification-based melody transcription
    Daniel P. W. Ellis
    Graham E. Poliner
    Machine Learning, 2006, 65 : 439 - 456
  • [6] Classification-Based Color Constancy
    Bianco, Simone
    Ciocca, Gianluigi
    Cusano, Claudio
    Schettini, Raimondo
    VISUAL INFORMATION SYSTEMS: WEB-BASED VISUAL INFORMATION SEARCH AND MANAGEMENT, VISUAL 2008, 2008, 5188 : 104 - 113
  • [7] A Classification-based Review Recommender
    O'Mahony, Michael P.
    Smyth, Barry
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 49 - 62
  • [8] Classification-based melody transcription
    Ellis, Daniel P. W.
    Poliner, Graham E.
    MACHINE LEARNING, 2006, 65 (2-3) : 439 - 456
  • [9] Classification-based objective functions
    Michael Rimer
    Tony Martinez
    Machine Learning, 2006, 63 : 183 - 205
  • [10] An overview and performance evaluation of classification-based least squares trained filters
    Shao, Ling
    Zhang, Hui
    de Haan, Gerard
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (10) : 1772 - 1782