Persistence-Based Clustering in Riemannian Manifolds

被引:0
|
作者
Chazal, Frederic [1 ]
Oudot, Steve [1 ]
Skraba, Primoz [1 ]
Guibas, Leonidas J.
机构
[1] INRIA Saclay Ile de France, F-91893 Orsay, France
基金
美国国家科学基金会;
关键词
Unsupervised Learning; Clustering; Mode Seeking; Topological Persistence; Morse Theory;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a clustering scheme that combines a mode-seeking phase with a cluster merging phase in the corresponding density map. While mode detection is done by a standard graph-based hill-climbing scheme, the novelty of our approach resides in its use of topological persistence to guide the merging of clusters. Our algorithm provides additional feedback in the form of a set of points in the plane, called a persistence diagram (PD), which provably reflects the prominences of the modes of the density. In practice, this feedback enables the user to choose relevant parameter values, so that under mild sampling conditions the algorithm will output the correct number of clusters, a notion that can be made formally sound within persistence theory. The algorithm only requires rough estimates of the density at the data points, and knowledge of (approximate) pairwise distances between them. It is therefore applicable in any metric space. Meanwhile, its complexity remains practical: although the size of the input distance matrix may be up to quadratic in the number of data points, a careful implementation only uses a linear amount of memory and takes barely more time to run than to read through the input. In this conference version of the paper we emphasize the experimental aspects of our work, describing the approach, giving an intuitive overview of its theoretical guarantees, discussing the choice of its parameters in practice, and demonstrating its potential in terms of applications through a series of experimental results obtained on synthetic and real-life data sets. Precise statements and proofs of our theoretical claims can be found in the full version of the paper [7].
引用
收藏
页码:97 / 106
页数:10
相关论文
共 50 条
  • [1] Persistence-Based Clustering in Riemannian Manifolds
    Chazal, Frederic
    Guibas, Leonidas J.
    Oudot, Steve Y.
    Skraba, Primoz
    [J]. JOURNAL OF THE ACM, 2013, 60 (06)
  • [2] Persistence-based clustering with outlier-removing filtration
    Bois, Alexandre
    Tervil, Brian
    Oudre, Laurent
    [J]. FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2024, 10
  • [3] Persistence-based Structural Recognition
    Li, Chunyuan
    Ovsjanikov, Maks
    Chazal, Frederic
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2003 - 2010
  • [4] Clustering and dimensionality reduction on Riemannian manifolds
    Goh, Alvina
    Vidal, Rene
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 626 - 632
  • [5] The evolution of biogeochemical recycling by persistence-based selection
    Boyle, Richard A.
    Lenton, Timothy M.
    [J]. COMMUNICATIONS EARTH & ENVIRONMENT, 2022, 3 (01):
  • [6] Towards Persistence-Based Reconstruction in Euclidean Spaces
    Chazal, Frederic
    Oudot, Steve Y.
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH ANNUAL SYMPOSIUM ON COMPUTATIONAL GEOMETRY (SGG'08), 2008, : 232 - 241
  • [7] Persistence-Based Pooling for Shape Pose Recognition
    Bonis, Thomas
    Ovsjanikov, Maks
    Oudot, Steve
    Chazal, Frederic
    [J]. COMPUTATIONAL TOPOLOGY IN IMAGE CONTEXT, CTIC 2016, 2016, 9667 : 19 - 29
  • [8] The evolution of biogeochemical recycling by persistence-based selection
    Richard A. Boyle
    Timothy M. Lenton
    [J]. Communications Earth & Environment, 3
  • [9] Persistence-Based Branch Misprediction Bounds for WCET Analysis
    Puffitsch, Wolfgang
    [J]. 30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 1898 - 1905
  • [10] Persistence-based temporal filtering for MODIS snow products
    Morriss, B. F.
    Ochs, E.
    Deeb, E. J.
    Newman, S. D.
    Daly, S. F.
    Gagnon, J. J.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2016, 175 : 130 - 137