A theory of proximity based clustering: structure detection by optimization

被引:48
|
作者
Puzicha, J [1 ]
Hofmann, T
Buhmann, JM
机构
[1] Univ Bonn, Inst Informat 3, D-5300 Bonn, Germany
[2] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
clustering; proximity data; similarity; deterministic annealing; texture segmentation; document retrieval;
D O I
10.1016/S0031-3203(99)00076-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a systematic optimization approach for clustering proximity or similarity data is developed. Starting from Fundamental invariance and robustness properties, a set of axioms is proposed and discussed to distinguish different cluster compactness and separation criteria. The approach covers the case of sparse proximity matrices, and is extended to nested partitionings for hierarchical data clustering. To solve the associated optimization problems, a rigorous mathematical framework for deterministic annealing and mean-field approximation is presented. Efficient optimization heuristics are derived in a canonical way, which also clarifies the relation to stochastic optimization by Gibbs sampling. Similarity-based clustering techniques have a broad range of possible applications in computer vision, pattern recognition, and data analysis. As a major practical application we present a novel approach to the problem of unsupervised texture segmentation, which relies on statistical tests as a measure of homogeneity. The quality of the algorithms is empirically evaluated on a large collection of Brodatz-like micro-texture Mondrians and on a set of real-word images. To demonstrate the broad usefulness of the theory of proximity based clustering the performances of different criteria and algorithms are compared on an information retrieval task for a document database. The superiority of optimization algorithms for clustering is supported by extensive experiments. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:617 / 634
页数:18
相关论文
共 50 条
  • [1] Network structure and the optimization of proximity-based association criteria
    Gomes, Ana Cristina R.
    Boogert, Neeltje J.
    Cardoso, Goncalo C.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (01): : 88 - 100
  • [2] Optimization of heartbeat detection based on clustering and multimethod approach
    Sprager, Sebastijan
    Zazula, Damjan
    [J]. 2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 5 - 8
  • [3] Graph Clustering Based on Optimization of a Macroscopic Structure of Clusters
    Taniguchi, Yuta
    Ikeda, Daisuke
    [J]. DISCOVERY SCIENCE, 2011, 6926 : 335 - 350
  • [4] An Optimality Theory-Based Proximity Measure for Set-Based Multiobjective Optimization
    Deb, Kalyanmoy
    Abouhawwash, Mohamed
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (04) : 515 - 528
  • [5] A DC optimization-based clustering technique for edge detection
    Khalaf, W.
    Astorino, A.
    D'Alessandro, P.
    Gaudioso, M.
    [J]. OPTIMIZATION LETTERS, 2017, 11 (03) : 627 - 640
  • [6] A DC optimization-based clustering technique for edge detection
    W. Khalaf
    A. Astorino
    P. D’Alessandro
    M. Gaudioso
    [J]. Optimization Letters, 2017, 11 : 627 - 640
  • [7] Fuzzy relational clustering based on comparing two proximity matrices with utilization of particle swarm optimization
    Roelof K. Brouwer
    Albert Groenwold
    [J]. Soft Computing, 2009, 13 : 577 - 589
  • [8] Fuzzy relational clustering based on comparing two proximity matrices with utilization of particle swarm optimization
    Brouwer, Roelof K.
    Groenwold, Albert
    [J]. SOFT COMPUTING, 2009, 13 (06) : 577 - 589
  • [9] A method of proximity matrix based fuzzy clustering
    Brouwer, Roelof K.
    Groenwold, Albert
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 91 - +
  • [10] A proximity approach to DNA based clustering analysis
    Abu Bakar, Rohani Binti
    Watada, Junzo
    Pedrycz, Witold
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (05): : 1203 - 1212