Stable and Consistent Density-Based Clustering via Multiparameter Persistence

被引:0
|
作者
Rolle, Alexander [1 ]
Scoccola, Luis [2 ]
机构
[1] Tech Univ Munich, Dept Math, Boltzmannstr 3, D-85748 Garching, Germany
[2] Univ Oxford, Math Inst, Woodstock Rd, Oxford OX2 6GG, England
基金
英国工程与自然科学研究理事会; 美国国家科学基金会; 奥地利科学基金会;
关键词
density-based clustering; topological data analysis; hierarchical clustering; multiparameter persistent homology; interleaving distance; vineyard; SINGLE LINKAGE; STABILITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the degree-Rips construction from topological data analysis, which provides a density-sensitive, multiparameter hierarchical clustering algorithm. We analyze its stability to perturbations of the input data using the correspondence-interleaving distance, a metric for hierarchical clusterings that we introduce. Taking certain one-parameter slices of degree-Rips recovers well-known methods for density-based clustering, but we show that these methods are unstable. However, we prove that degree-Rips, as a multiparameter object, is stable, and we propose an alternative approach for taking slices of degree-Rips, which yields a one-parameter hierarchical clustering algorithm with better stability properties. We prove that this algorithm is consistent, using the correspondence-interleaving distance. We provide an algorithm for extracting a single clustering from one-parameter hierarchical clusterings, which is stable with respect to the correspondence-interleaving distance. And, we integrate these methods into a pipeline for density-based clustering, which we call Persistable. Adapting tools from multiparameter persistent homology, we propose visualization tools that guide the selection of all parameters of the pipeline. We demonstrate Persistable on benchmark data sets, showing that it identifies multi-scale cluster structure in data.
引用
收藏
页数:74
相关论文
共 50 条
  • [1] Density-based clustering
    Campello, Ricardo J. G. B.
    Kroeger, Peer
    Sander, Jorg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (02)
  • [2] Density-based clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Sander, Joerg
    Zimek, Arthur
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (03) : 231 - 240
  • [3] Energy replenishment optimisation via density-based clustering
    Gu, Xin
    Peng, Jun
    Cheng, Yijun
    Zhang, Xiaoyong
    Liu, Kaiyang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 21 (02) : 271 - 280
  • [4] Mining Stable Communities in Temporal Networks by Density-Based Clustering
    Qin, Hongchao
    Li, Rong-Hua
    Wang, Guoren
    Huang, Xin
    Yuan, Ye
    Yu, Jeffrey Xu
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (03) : 671 - 684
  • [5] Density-Based Clustering with Constraints
    Lasek, Piotr
    Gryz, Jarek
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2019, 16 (02) : 469 - 489
  • [6] Density-Based Clustering of Polygons
    Joshi, Deepti
    Samal, Ashok K.
    Soh, Leen-Kiat
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 171 - 178
  • [7] Directional density-based clustering
    Saavedra-Nieves, Paula
    Fernandez-Perez, Martin
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024,
  • [8] Active Density-Based Clustering
    Mai, Son T.
    He, Xiao
    Hubig, Nina
    Plant, Claudia
    Boehm, Christian
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 508 - 517
  • [9] Fast Parameterless Density-Based Clustering via Random Projections
    Schneider, Johannes
    Vlachos, Michail
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 861 - 866
  • [10] Stability of Density-Based Clustering
    Rinaldo, Alessandro
    Singh, Aarti
    Nugent, Rebecca
    Wasserman, Larry
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 905 - 948