RNN-DBSCAN: A Density-Based Clustering Algorithm Using Reverse Nearest Neighbor Density Estimates

被引:195
|
作者
Bryant, Avory [1 ,2 ]
Cios, Krzysztof [3 ,4 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Med Coll Virginia Campus, Richmond, VA 23284 USA
[2] Naval Surface Warfare Ctr Dahlgren Div, Dahlgren, VA 22448 USA
[3] Virginia Commonwealth Univ, Comp Sci Dept, Med Coll Virginia Campus, Richmond, VA 23284 USA
[4] Polish Acad Sci, PL-20290 Lublin, Poland
关键词
Unsupervised learning; pattern analysis; clustering algorithms; pattern clustering; density estimation robust algorithm; nearest neighbor searches;
D O I
10.1109/TKDE.2017.2787640
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new density-based clustering algorithm, RNN-DBSCAN, is presented which uses reverse nearest neighbor counts as an estimate of observation density. Clustering is performed using a DBSCAN-like approach based on k nearest neighbor graph traversals through dense observations. RNN-DBSCAN is preferable to the popular density-based clustering algorithm DBSCAN in two aspects. First, problem complexity is reduced to the use of a single parameter (choice of k nearest neighbors), and second, an improved ability for handling large variations in cluster density (heterogeneous density). The superiority of RNN-DBSCAN is demonstrated on several artificial and real-world datasets with respect to prior work on reverse nearest neighbor based clustering approaches (RECORD, IS-DBSCAN, and ISB-DBSCAN) along with DBSCAN and OPTICS. Each of these clustering approaches is described by a common graph-based interpretation wherein clusters of dense observations are defined as connected components, along with a discussion on their computational complexity. Heuristics for RNN-DBSCAN parameter selection are presented, and the effects of k on RNN-DBSCAN clusterings discussed. Additionally, with respect to scalability, an approximate version of RNN-DBSCAN is presented leveraging an existing approximate k nearest neighbor technique.
引用
收藏
页码:1109 / 1121
页数:13
相关论文
共 50 条
  • [31] Reverse-Nearest-Neighbor-Based Clustering by Fast Search and Find of Density Peaks
    Zhang, Chunhao
    Xie, Bin
    Zhang, Yiran
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (06) : 1341 - 1354
  • [32] Reverse-Nearest-Neighbor-Based Clustering by Fast Search and Find of Density Peaks
    ZHANG Chunhao
    XIE Bin
    ZHANG Yiran
    ChineseJournalofElectronics, 2023, 32 (06) : 1341 - 1354
  • [33] Density peaks clustering algorithm with nearest neighbor optimization for data with uneven density distribution
    Chen W.-C.
    Zhao J.
    Xiao R.-B.
    Wang H.
    Cui Z.-H.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (03): : 919 - 928
  • [34] Constrained Density-Based Spatial Clustering of Applications with Noise (DBSCAN) using hyperparameter optimization
    Kim, Jongwon
    Lee, Hyeseon
    Ko, Young Myoung
    KNOWLEDGE-BASED SYSTEMS, 2024, 303
  • [35] An Algorithm to Adaptive Determination of Density Threshold for Density-based Clustering
    Ke, Zhang
    Lei, Huang
    Yi, Chai
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 3929 - 3935
  • [36] A NEW K-NEAREST NEIGHBOR DENSITY-BASED CLUSTERING METHOD AND ITS APPLICATION TO HYPERSPECTRAL IMAGES
    Cariou, Claude
    Chehdi, Kacem
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6161 - 6164
  • [37] DBSCAN-MS: Distributed Density-Based Clustering in Metric Spaces
    Yang, Keyu
    Gao, Yunjun
    Ma, Rui
    Chen, Lu
    Wu, Sai
    Chen, Gang
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1346 - 1357
  • [38] A Density-based clustering algorithm suitable to various density dataset
    School of Software, Dalian University of Technology, Dalian 116621, China
    J. Comput. Inf. Syst., 2008, 6 (2473-2481):
  • [39] NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data
    Lulli, Alessandro
    Dell'Amico, Matteo
    Michiardi, Pietro
    Ricci, Laura
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 10 (03): : 157 - 168
  • [40] DBSCAN-MO: Density-Based Clustering among Moving Obstacles
    Stefanakis, Emmanuel
    EUROPEAN INFORMATION SOCIETY: TAKING GEOINFORMATION SCIENCE ONE STEP FURTHER, 2009, : 159 - 179