A method for efficient clustering of spatial data in network space

被引:7
|
作者
Nguyen, Trang T. D. [1 ]
Nguyen, Loan T. T. [2 ,3 ]
Anh Nguyen [4 ]
Yun, Unil [5 ]
Bay Vo [6 ]
机构
[1] Nha Trang Univ, Fac Informat Technol, Nha Trang, Vietnam
[2] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[4] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland
[5] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[6] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Spatial data mining; spatial data clustering; NS-DBSCAN; network spatial analysis; FAST SEARCH; ALGORITHM; DBSCAN; FIND;
D O I
10.3233/JIFS-202806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial clustering is one of the main techniques for spatial data mining and spatial data analysis. However, existing spatial clustering methods primarily focus on points distributed in planar space with the Euclidean distance measurement. Recently, NS-DBSCAN has been developed to perform clustering of spatial point events in Network Space based on a well-known clustering algorithm, named Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The NS-DBSCAN algorithm has efficiently solved the problem of clustering network constrained spatial points. When compared to the NC_DT (Network-Constraint Delaunay Triangulation) clustering algorithm, the NS-DBSCAN algorithm efficiently solves the problem of clustering network constrained spatial points by visualizing the intrinsic clustering structure of spatial data by constructing density ordering charts. However, the main drawback of this algorithm is when the data are processed, objects that are not specifically categorized into types of clusters cannot be removed, which is undeniably a waste of time, particularly when the dataset is large. In an attempt to have this algorithm work with great efficiency, we thus recommend removing edges that are longer than the threshold and eliminating low-density points from the density ordering table when forming clusters and also take other effective techniques into consideration. In this paper, we develop a theorem to determine the maximum length of an edge in a road segment. Based on this theorem, an algorithm is proposed to greatly improve the performance of the density-based clustering algorithm in network space (NS-DBSCAN). Experiments using our proposed algorithm carried out in collaboration with Ho Chi Minh City, Vietnam yield the same results but shows an advantage of it over NS-DBSCAN in execution time.
引用
收藏
页码:11653 / 11670
页数:18
相关论文
共 50 条
  • [32] Clustering spatial transcriptomics data
    Teng, Haotian
    Yuan, Ye
    Bar-Joseph, Ziv
    BIOINFORMATICS, 2022, 38 (04) : 997 - 1004
  • [33] Clustering Spatial Functional Data: A Method Based on a Nonparametric Variogram Estimation
    Romano, Elvira
    Verde, Rosanna
    Cozza, Valentina
    NEW PERSPECTIVES IN STATISTICAL MODELING AND DATA ANALYSIS, 2011, : 339 - 346
  • [34] Spatial Clustering of Functional Data
    Secchi, Piercesare
    Vantini, Simone
    Vitelli, Valeria
    RECENT ADVANCES IN FUNCTIONAL DATA ANALYSIS AND RELATED TOPICS, 2011, : 283 - 289
  • [35] Clustering data with spatial continuity
    Pham, TD
    10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 69 - 72
  • [36] Method for data clustering in a high dimensional space based on a hypergraph model
    Zhang, Rong
    Peng, Hong
    Jisuanji Gongcheng/Computer Engineering, 2002, 28 (07):
  • [37] cBiK: A Space-Efficient Data Structure for Spatial Keyword Queries
    Sanjuan-Contreras, Carlos E.
    Gutierrez Retamal, Gilberto
    Martinez-Prieto, Miguel A.
    Seco, Diego
    IEEE ACCESS, 2020, 8 (08): : 98827 - 98846
  • [38] Data clustering in tolerance space
    Tzeng, CH
    Sun, FS
    ADVANCES IN INTELLIGENT DATA ANALYSIS V, 2003, 2810 : 297 - 306
  • [39] STiMR k-Means: An Efficient Clustering Method for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (08)
  • [40] A clustering method for efficient segmentation of 3D laser data
    Klasing, Klaas
    Wollherr, Dirk
    Buss, Martin
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 4043 - 4048