DenForest: Enabling Fast Deletion in Incremental Density-Based Clustering over Sliding Windows

被引:3
|
作者
Kim, Bogyeong [1 ]
Koo, Kyoseung [1 ]
Enkhbat, Undraa [1 ]
Moon, Bongki [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
DenForest; Density-Based Clustering; Data Stream; Sliding Window; POINT CLOUDS; DBSCAN; ALGORITHMS; SEARCH;
D O I
10.1145/3514221.3517833
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The density-based clustering is utilized for various applications such as hot spot detection or segmentation. To serve those applications in real time, it is desired to update clusters incrementally by capturing only the recent data. The previous incremental density-based clustering algorithms often represent clusters as a graph and suffer serious performance degradation. This is because a costly graph traversal is required to check whether a cluster is still connected whenever a point is removed. In order to address the problem of slow deletion, this paper proposes a novel incremental density-based clustering algorithm called DenForest. By maintaining clusters as a group of spanning trees instead of a graph, DenForest can determine efficiently and accurately whether a cluster is to be split by a point removed from the window in logarithmic time. With extensive evaluations, it is demonstrated that DenForest outperforms the state-of-the-art density-based clustering algorithms significantly and achieves the clustering quality comparable with that of DBSCAN.
引用
收藏
页码:296 / 309
页数:14
相关论文
共 50 条
  • [1] StreamSW: A density-based approach for clustering data streams over sliding windows
    Reddy, K. Shyam Sunder
    Bindu, C. Shoba
    [J]. MEASUREMENT, 2019, 144 : 14 - 19
  • [2] DISC: Density-Based Incremental Clustering by Striding over Streaming Data
    Kim, Bogyeong
    Koo, Kyoseung
    Kim, Juhun
    Moon, Bongki
    [J]. 2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 828 - 839
  • [3] Incremental density-based ensemble clustering over evolving data streams
    Khan, Imran
    Huang, Joshua Z.
    Ivanov, Kamen
    [J]. NEUROCOMPUTING, 2016, 191 : 34 - 43
  • [4] Incremental Density-Based Clustering on Multicore Processors
    Mai, Son T.
    Jacobsen, Jon
    Amer-Yahia, Sihem
    Spence, Ivor
    Nhat-Phuong Tran
    Assent, Ira
    Quoc Viet Hung Nguyen
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1338 - 1356
  • [5] A Fuzzy Density-based Incremental Clustering Algorithm
    Laohakiat, Sirisup
    Ratanajaipan, Photchanan
    Navaravong, Leenhapat
    Ungrangsi, Rachanee
    Maleewong, Krissada
    [J]. 2018 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2018, : 211 - 215
  • [6] Incremental grid density-based clustering algorithm
    Chen, Ning
    Chen, An
    Zhou, Long-Xiang
    [J]. Ruan Jian Xue Bao/Journal of Software, 2002, 13 (01): : 1 - 7
  • [7] Fast density-based clustering algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Hu, Yunfa
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (11): : 1287 - 1292
  • [8] Incremental Shared Nearest Neighbor Density-Based Clustering
    Singh, Sumeet
    Awekar, Amit
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1533 - 1536
  • [9] An Incremental Density-Based Clustering Technique for Large Datasets
    Rehman, Saif Ur
    Khan, Muhammed Naeem Ahmed
    [J]. COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS 2010, 2010, 85 : 3 - 11
  • [10] Fast density estimation for density-based clustering methods
    Cheng, Difei
    Xu, Ruihang
    Zhang, Bo
    Jin, Ruinan
    [J]. NEUROCOMPUTING, 2023, 532 : 170 - 182