NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data

被引:2
|
作者
Lulli, Alessandro [1 ,2 ]
Dell'Amico, Matteo [3 ]
Michiardi, Pietro [4 ]
Ricci, Laura [1 ,2 ]
机构
[1] Univ Pisa, I-56100 Pisa, Italy
[2] CNR, ISTI, Pisa, Italy
[3] Symantec Res Labs, Paris, France
[4] EURECOM, Campus SophiaTech, Biot, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 10卷 / 03期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present NG-DBSCAN, an approximate density-based clustering algorithm that operates on arbitrary data and any symmetric distance measure. The distributed design of our algorithm makes it scalable to very large datasets; its approximate nature makes it fast, yet capable of producing high quality clustering results. We provide a detailed overview of the steps of NG-DBSCAN, together with their analysis. Our results, obtained through an extensive experimental campaign with real and synthetic data, substantiate our claims about NG-DBSCAN's performance and scalability.
引用
收藏
页码:157 / 168
页数:12
相关论文
共 50 条
  • [21] PPA-DBSCAN: Privacy-Preserving ρ-Approximate Density-Based Clustering
    Fu, Jiaxuan
    Cheng, Ke
    Chang, Zhao
    Shen, Yulong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (06) : 5324 - 5340
  • [22] Comparative Analysis Review of Pioneering DBSCAN and Successive Density-Based Clustering Algorithms
    Bushra, Adil Abdu
    Yi, Gangman
    IEEE ACCESS, 2021, 9 : 87918 - 87935
  • [23] HiClus: Highly Scalable Density-based Clustering with Heterogeneous Cloud
    Chen, Chun-Chieh
    Chen, Ming-Syan
    INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 149 - 157
  • [24] ADvaNCE - Efficient and Scalable Approximate Density-Based Clustering Based on Hashing
    Li, Tianrun
    Heinis, Thomas
    Luk, Wayne
    INFORMATICA, 2017, 28 (01) : 105 - 130
  • [25] An Efficient Density-Based Algorithm for Data Clustering
    Theljani, Foued
    Laabidi, Kaouther
    Zidi, Salah
    Ksouri, Moufida
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (04)
  • [26] Anytime density-based clustering of complex data
    Son T. Mai
    Xiao He
    Jing Feng
    Claudia Plant
    Christian Böhm
    Knowledge and Information Systems, 2015, 45 : 319 - 355
  • [27] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    ALGORITHMS-ESA 2002, PROCEEDINGS, 2002, 2461 : 284 - 296
  • [28] Density-based clustering for exploration of analytical data
    Daszykowski, M
    Walczak, B
    Massart, DL
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2004, 380 (03) : 370 - 372
  • [29] Share density-based clustering of income data
    Condino, Francesca
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (04) : 336 - 347
  • [30] Geometric algorithms for density-based data clustering
    Chen, DZ
    Smid, M
    Xu, B
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2005, 15 (03) : 239 - 260