STRP-DBSCAN: A Parallel DBSCAN Algorithm Based on Spatial-Temporal Random Partitioning for Clustering Trajectory Data

被引:2
|
作者
An, Xiaoya [1 ]
Wang, Ziming [2 ]
Wang, Ding [2 ]
Liu, Song [2 ]
Jin, Cheng [1 ]
Xu, Xinpeng [2 ,3 ]
Cao, Jianjun [3 ]
机构
[1] Xian Res Inst Surveying & Mapping, State Key Lab Geoinformat Engn, Xian 710054, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[3] East China Res Inst Comp Technol, Shanghai 201818, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 20期
关键词
parallel DBSCAN algorithm; clustering parameters autotuning; deep reinforcement learning; spatial-temporal random partitioning; trajectory data clustering;
D O I
10.3390/app132011122
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Trajectory clustering algorithms analyze the movement trajectory of the target objects to mine the potential movement trend, regularity, and behavioral patterns of the object. Therefore, the trajectory clustering algorithm has a wide range of applications in the fields of traffic flow analysis, logistics and transportation management, and crime analysis. Existing algorithms do not make good use of the temporal attributes of trajectory data, resulting in a long clustering time and low clustering accuracy of spatial-temporal trajectory data. Meanwhile, the density-based clustering algorithms represented by DBSCAN are very sensitive to the clustering parameters. The radius value Eps and the minimal points number MinPts within Eps radius, defined by the user, have a significant impact on the clustering results, and tuning these parameters is difficult. In this paper, we present STRP-DBSCAN, a parallel DBSCAN algorithm based on spatial-temporal random partitioning for clustering trajectory data. It adopts spatial-temporal random partitioning to distribute balanced computation among different computing nodes and reduce the communication overhead of the parallel clustering algorithm, thus improving the execution efficiency of the DBSCAN algorithm. We also present the PER-SAC algorithm, which uses deep reinforcement learning to combine the prioritized experience replay (PER) and the soft actor-critic (SAC) algorithm for autotuning the optimal parameters of DBSCAN. The experimental results show that STRP-DBSCAN effectively reduces the clustering time of spatial-temporal trajectory data by up to 96.2% and 31.2% compared to parallel DBSCAN and the state-of-the-art RP-DBSCAN. The PER-SAC algorithm also outperforms the state-of-the-art DBSCAN parameter tuning algorithms and improves the clustering accuracy by up to 8.8%. At the same time, the proposed algorithm obtains a higher stability of clustering accuracy.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] RP-DBSCAN: A Superfast Parallel DBSCAN Algorithm Based on Random Partitioning
    Song, Hwanjun
    Lee, Jae-Gil
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1173 - 1187
  • [2] Research on Parallel Design of DBSCAN Clustering Algorithm in Spatial Data Mining
    Zhou, Gong-jian
    2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 343 - 348
  • [3] Data-partitioning-based DBSCAN algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (10): : 1153 - 1159
  • [4] Trajectory outlier detection based on DBSCAN clustering algorithm
    Zhou P.
    Ding Q.
    Luo H.
    Hou X.
    1600, Chinese Society of Astronautics (46):
  • [5] Research of parallel DBSCAN clustering algorithm based on MapReduce
    Fu, X. (xffu@gdut.edu.cn), 1600, Science and Engineering Research Support Society (07):
  • [6] An Adaptive Hierarchical Clustering Method for Ship Trajectory Data Based on DBSCAN Algorithm
    Zhao, Liangbin
    Shi, Guoyou
    Yang, Jiaxuan
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 334 - 341
  • [7] Parallel DBSCAN Algorithm Using a Data Partitioning Strategy with Spark Implementation
    Han, Dianwei
    Agrawal, Ankit
    Liao, Wei-keng
    Choudhary, Alok
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 305 - 312
  • [8] Research on the Parallelization of the DBSCAN Clustering Algorithm for Spatial Data Mining Based on the Spark Platform
    Huang, Fang
    Zhu, Qiang
    Zhou, Ji
    Tao, Jian
    Zhou, Xiaocheng
    Jin, Du
    Tan, Xicheng
    Wang, Lizhe
    REMOTE SENSING, 2017, 9 (12)
  • [9] An SNN-DBSCAN Based Clustering Algorithm for Big Data
    Pandey, Sriniwas
    Samal, Mamata
    Mohanty, Sraban Kumar
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 127 - 137
  • [10] Laser Radar Data Registration Algorithm Based on DBSCAN Clustering
    Liu, Yiting
    Zhang, Lei
    Li, Peijuan
    Jia, Tong
    Du, Junfeng
    Liu, Yawen
    Li, Rui
    Yang, Shutao
    Tong, Jinwu
    Yu, Hanqi
    ELECTRONICS, 2023, 12 (06)