GB-DBSCAN: A fast granular-ball based DBSCAN clustering algorithm

被引:4
|
作者
Cheng, Dongdong [1 ,2 ,3 ,4 ]
Zhang, Cheng [2 ,3 ,4 ]
Li, Ya [2 ,3 ,4 ]
Xia, Shuyin [2 ,3 ,4 ]
Wang, Guoyin [2 ,3 ,4 ]
Huang, Jinlong [1 ]
Zhang, Sulan [1 ]
Xie, Jiang [2 ,3 ,4 ]
机构
[1] Yangtze Normal Univ, Coll Big Data & Intelligent Engn, Chongqing 408100, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Comp Intelligence, Chongqing 400065, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Key Lab Cyberspace Big Data Intelligent Secur, Minist Educ, Chongqing 400065, Peoples R China
[4] Chongqing Univ Posts & Telecommun, Key Lab Big Data Intelligent Comp, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
DBSCAN; Granular-ball; KNN; Clustering;
D O I
10.1016/j.ins.2024.120731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Density-Based Spatial Clustering of Applications with Noise (DBSCAN) identifies high-density connected areas as clusters, so that it has advantages in discovering arbitrary-shaped clusters. However, it has difficulty in adjusting parameters and since it needs to scan all data points in turn, its time complexity is O(n2). Granular-ball (GB) is a coarse grained representation of data. It is on the basis of the assumption that an object and its local neighbors have similar distribution and they have high possibility of belonging to the same class. It has been introduced into supervised learning by Xia et al. to improve the efficiency of supervised learning. Inspired by the idea of granular-ball, we introduce it into unsupervised learning and use it to improve the efficiency of DBSCAN, called GB-DBSCAN. The main idea of the proposed algorithm GB-DBSCAN is to employ granular-ball to represent a set of data points and then clustering on granular-balls, instead of the data points. Firstly, we use k-nearest neighbors (KNN) to generate granular-balls, which is a bottom-up strategy, and describe granular-balls according to their centers and radius. Then, the granular-balls are divided into Core-GBs and Non-Core-GBs according to their density. After that, the Core-GBs are merged into clusters according to the idea of DBSCAN and the Non-Core-GBs are assigned to the appropriate clusters. Since the granular-balls' number is much smaller than the size of the objects in a dataset, the running time of DBSCAN is greatly reduced. By comparing with KNN-BLOCK DBSCAN, RNN-DBSCAN, DBSCAN, K-means, DP and SNN-DPC algorithms, the proposed algorithm can get similar or even better clustering result in much less running time.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Tra-DBScan: a Algorithm of Clustering Trajectories
    Liu, Liangxu
    Song, Jiatao
    Guan, Bo
    Wu, Zhaoxiao
    He, Kejia
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 4875 - 4879
  • [32] The Parameter Configuration Method of DBSCAN Clustering Algorithm
    Song, Jin-yu
    Guo, Yi-ping
    Wang, Bin
    2018 5TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2018, : 1062 - 1070
  • [33] Weather forecasting using DBSCAN clustering algorithm
    Chefrour, Aida
    ANNALES MATHEMATICAE ET INFORMATICAE, 2022, 55 : 12 - 27
  • [34] A fast DBSCAN clustering algorithm by accelerating neighbor searching using Groups method
    Kumar, K. Mahesh
    Reddy, A. Rama Mohan
    PATTERN RECOGNITION, 2016, 58 : 39 - 48
  • [35] Dynamic DBSCAN-GM Clustering Algorithm
    Smiti, Abir
    Elouedi, Zied
    2015 16TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2015, : 311 - 316
  • [36] l-DBSCAN :: A fast hybrid density based clustering method
    Viswanath, P.
    Pinkesh, Rajwala
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 912 - +
  • [37] STRP-DBSCAN: A Parallel DBSCAN Algorithm Based on Spatial-Temporal Random Partitioning for Clustering Trajectory Data
    An, Xiaoya
    Wang, Ziming
    Wang, Ding
    Liu, Song
    Jin, Cheng
    Xu, Xinpeng
    Cao, Jianjun
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [38] Clustering and application of grain temperature statistical parameters based on the DBSCAN algorithm
    Cui, Hongwei
    Wu, Wenfu
    Zhang, Zhongjie
    Han, Feng
    Liu, Zhe
    JOURNAL OF STORED PRODUCTS RESEARCH, 2021, 93
  • [39] Identification of Convective and Stratiform Clouds Based on the Improved DBSCAN Clustering Algorithm
    Yuanyuan ZUO
    Zhiqun HU
    Shujie YUAN
    Jiafeng ZHENG
    Xiaoyan YIN
    Boyong LI
    Advances in Atmospheric Sciences, 2022, 39 (12) : 2203 - 2212
  • [40] Crowdsourcing Logistics Pricing Optimization Model Based on DBSCAN Clustering Algorithm
    Li, Zhichao
    Li, Yilin
    Lu, Wanchun
    Huang, Jilin
    IEEE ACCESS, 2020, 8 : 92615 - 92626