NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data

被引:2
|
作者
Lulli, Alessandro [1 ,2 ]
Dell'Amico, Matteo [3 ]
Michiardi, Pietro [4 ]
Ricci, Laura [1 ,2 ]
机构
[1] Univ Pisa, I-56100 Pisa, Italy
[2] CNR, ISTI, Pisa, Italy
[3] Symantec Res Labs, Paris, France
[4] EURECOM, Campus SophiaTech, Biot, France
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 10卷 / 03期
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present NG-DBSCAN, an approximate density-based clustering algorithm that operates on arbitrary data and any symmetric distance measure. The distributed design of our algorithm makes it scalable to very large datasets; its approximate nature makes it fast, yet capable of producing high quality clustering results. We provide a detailed overview of the steps of NG-DBSCAN, together with their analysis. Our results, obtained through an extensive experimental campaign with real and synthetic data, substantiate our claims about NG-DBSCAN's performance and scalability.
引用
收藏
页码:157 / 168
页数:12
相关论文
共 50 条
  • [1] dbscan: Fast Density-Based Clustering with R
    Hahsler, Michael
    Piekenbrock, Matthew
    Doran, Derek
    JOURNAL OF STATISTICAL SOFTWARE, 2019, 91 (01): : 1 - 30
  • [2] C-DBSCAN: Density-based clustering with constraints
    Ruiz, Carlos
    Spiliopoulou, Myra
    Menasalvas, Ernestina
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2007, 4482 : 216 - +
  • [3] Scalable density-based distributed clustering
    Januzaj, E
    Kriegel, HP
    Pfeifle, M
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 231 - 244
  • [4] KM-DBSCAN: Density-Based Clustering of Massive Spatial Data with Keywords
    Jang, Hong-Jun
    Kim, Byoungwook
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021, 11
  • [5] ADAPTIVE DENSITY-BASED SPATIAL CLUSTERING OF APPLICATIONS WITH NOISE (DBSCAN) ACCORDING TO DATA
    Wang, Wei-Tung
    Wu, Yi-Leh
    Tang, Cheng-Yuan
    Hor, Maw-Kae
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 445 - 451
  • [6] MDST-DBSCAN: A Density-Based Clustering Method for Multidimensional Spatiotemporal Data
    Choi, Changlock
    Hong, Seong-Yun
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (06)
  • [7] An Efficient And Scalable Density-Based Clustering Algorithm For Normalize Data
    Nidhi
    Patel, Km Archana
    2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, COMMUNICATION & CONVERGENCE, ICCC 2016, 2016, 92 : 136 - 141
  • [8] Significant DBSCAN plus : Statistically Robust Density-based Clustering
    Xie, Yiqun
    Jia, Xiaowei
    Shekhar, Shashi
    Bao, Han
    Zhou, Xun
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (05)
  • [9] FEM-DBSCAN: An Efficient Density-Based Clustering Approach
    Uranus Kazemi
    Reza Boostani
    Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2021, 45 : 979 - 992
  • [10] Scalable local density-based distributed clustering
    Liu Yan-bing
    Liu Zhang-xiong
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9491 - 9498