A method for efficient clustering of spatial data in network space

被引:7
|
作者
Nguyen, Trang T. D. [1 ]
Nguyen, Loan T. T. [2 ,3 ]
Anh Nguyen [4 ]
Yun, Unil [5 ]
Bay Vo [6 ]
机构
[1] Nha Trang Univ, Fac Informat Technol, Nha Trang, Vietnam
[2] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[4] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland
[5] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[6] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Spatial data mining; spatial data clustering; NS-DBSCAN; network spatial analysis; FAST SEARCH; ALGORITHM; DBSCAN; FIND;
D O I
10.3233/JIFS-202806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial clustering is one of the main techniques for spatial data mining and spatial data analysis. However, existing spatial clustering methods primarily focus on points distributed in planar space with the Euclidean distance measurement. Recently, NS-DBSCAN has been developed to perform clustering of spatial point events in Network Space based on a well-known clustering algorithm, named Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The NS-DBSCAN algorithm has efficiently solved the problem of clustering network constrained spatial points. When compared to the NC_DT (Network-Constraint Delaunay Triangulation) clustering algorithm, the NS-DBSCAN algorithm efficiently solves the problem of clustering network constrained spatial points by visualizing the intrinsic clustering structure of spatial data by constructing density ordering charts. However, the main drawback of this algorithm is when the data are processed, objects that are not specifically categorized into types of clusters cannot be removed, which is undeniably a waste of time, particularly when the dataset is large. In an attempt to have this algorithm work with great efficiency, we thus recommend removing edges that are longer than the threshold and eliminating low-density points from the density ordering table when forming clusters and also take other effective techniques into consideration. In this paper, we develop a theorem to determine the maximum length of an edge in a road segment. Based on this theorem, an algorithm is proposed to greatly improve the performance of the density-based clustering algorithm in network space (NS-DBSCAN). Experiments using our proposed algorithm carried out in collaboration with Ho Chi Minh City, Vietnam yield the same results but shows an advantage of it over NS-DBSCAN in execution time.
引用
收藏
页码:11653 / 11670
页数:18
相关论文
共 50 条
  • [1] An efficient topological-based clustering method on spatial data in network space
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Yun, Unil
    Vo, Bay
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [2] Efficient density clustering method for spatial data
    Pan, F
    Wang, BY
    Zhang, Y
    Ren, DM
    Hu, X
    Perrizo, W
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 : 375 - 386
  • [3] NS-IDBSCAN: An efficient incremental clustering method for geospatial data in network space
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Duy, Le Nhat
    Vo, Bay
    INFORMATION SCIENCES, 2025, 690
  • [4] A Novel Spatial Clustering Method based on Wavelet Network and Density Analysis for Data Stream
    Xu, Chonghuan
    JOURNAL OF COMPUTERS, 2013, 8 (08) : 2139 - 2143
  • [5] An Efficient Clustering Method for Medical Data Applications
    Li, Shuai
    Zhou, Xiaofeng
    Shi, Haibo
    Zheng, Zeyu
    2015 IEEE INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2015, : 133 - 138
  • [6] Efficient strategies for spatial data clustering using topological relations
    Nguyen, Trang T. D.
    Nguyen, Loan T. T.
    Bui, Quang-Thinh
    Duy, Le Nhat
    Pedrycz, Witold
    Vo, Bay
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [7] An efficient space division–based width optimization method for RBF network using fuzzy clustering algorithms
    Yunwei Zhang
    Chunlin Gong
    Hai Fang
    Hua Su
    Chunna Li
    Andrea Da Ronch
    Structural and Multidisciplinary Optimization, 2019, 60 : 461 - 480
  • [8] CLARANS: A method for clustering objects for spatial data mining
    Ng, RT
    Han, JW
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (05) : 1003 - 1016
  • [9] Urban Green Space Assessment: Spatial Clustering Method Based on Multisource Data to Facilitate Zoning Planning
    Wu, Chao
    Yang, Shuo
    Ma, Yibin
    Liu, Pengyu
    Ye, Xinyue
    JOURNAL OF URBAN PLANNING AND DEVELOPMENT, 2024, 150 (04)
  • [10] Echidna: Efficient clustering of hierarchical data for network traffic analysis
    Mahmood, Abdun Naser
    Leckie, Christopher
    Udaya, Parampalli
    NETWORKING 2006: NETWORKING TECHNOLOGIES, SERVICES, AND PROTOCOLS; PERFORMANCE OF COMPUTER AND COMMUNICATION NETWORKS; MOBILE AND WIRELESS COMMUNICATIONS SYSTEMS, 2006, 3976 : 1092 - 1098