A method for efficient clustering of spatial data in network space

被引:7
|
作者
Nguyen, Trang T. D. [1 ]
Nguyen, Loan T. T. [2 ,3 ]
Anh Nguyen [4 ]
Yun, Unil [5 ]
Bay Vo [6 ]
机构
[1] Nha Trang Univ, Fac Informat Technol, Nha Trang, Vietnam
[2] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[4] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland
[5] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[6] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Spatial data mining; spatial data clustering; NS-DBSCAN; network spatial analysis; FAST SEARCH; ALGORITHM; DBSCAN; FIND;
D O I
10.3233/JIFS-202806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial clustering is one of the main techniques for spatial data mining and spatial data analysis. However, existing spatial clustering methods primarily focus on points distributed in planar space with the Euclidean distance measurement. Recently, NS-DBSCAN has been developed to perform clustering of spatial point events in Network Space based on a well-known clustering algorithm, named Density-Based Spatial Clustering of Applications with Noise (DBSCAN). The NS-DBSCAN algorithm has efficiently solved the problem of clustering network constrained spatial points. When compared to the NC_DT (Network-Constraint Delaunay Triangulation) clustering algorithm, the NS-DBSCAN algorithm efficiently solves the problem of clustering network constrained spatial points by visualizing the intrinsic clustering structure of spatial data by constructing density ordering charts. However, the main drawback of this algorithm is when the data are processed, objects that are not specifically categorized into types of clusters cannot be removed, which is undeniably a waste of time, particularly when the dataset is large. In an attempt to have this algorithm work with great efficiency, we thus recommend removing edges that are longer than the threshold and eliminating low-density points from the density ordering table when forming clusters and also take other effective techniques into consideration. In this paper, we develop a theorem to determine the maximum length of an edge in a road segment. Based on this theorem, an algorithm is proposed to greatly improve the performance of the density-based clustering algorithm in network space (NS-DBSCAN). Experiments using our proposed algorithm carried out in collaboration with Ho Chi Minh City, Vietnam yield the same results but shows an advantage of it over NS-DBSCAN in execution time.
引用
收藏
页码:11653 / 11670
页数:18
相关论文
共 50 条
  • [21] Scale space based on clustering method integrating spatial relationships and non-spatial attributes
    Huang Zhimin
    Zhang Xiaodong
    Zhang Feng
    Ke Weiliang
    GEOINFORMATICS 2006: GEOSPATIAL INFORMATION SCIENCE, 2006, 6420
  • [22] Spatial clustering of events on a network
    Steenberghen, Therese
    Aerts, Koen
    Thomas, Isabelle
    JOURNAL OF TRANSPORT GEOGRAPHY, 2010, 18 (03) : 411 - 418
  • [23] clusterMLD: An Efficient Hierarchical Clustering Method for Multivariate Longitudinal Data
    Zhou, Junyi
    Zhang, Ying
    Tu, Wanzhu
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1131 - 1144
  • [24] Data Clustering Method Using Efficient Fuzzifier Values Derivation
    Cho, Jaehyuk
    Joo, Wonhee
    IEEE ACCESS, 2020, 8 : 124624 - 124632
  • [25] An efficient clustering method for high-dimensional data mining
    Chang, JW
    Kim, YK
    ADVANCES IN ARTIFICIAL INTELLIGENCE - SBIA 2004, 2004, 3171 : 276 - 285
  • [26] DAIS: a method for identifying spatial domains based on density clustering of spatial omics data
    Yu, Qichao
    Tian, Ru
    Jin, Xin
    Wu, Liang
    JOURNAL OF GENETICS AND GENOMICS, 2024, 51 (08) : 884 - 887
  • [27] DAIS: a method for identifying spatial domains based on density clustering of spatial omics data
    Qichao Yu
    Ru Tian
    Xin Jin
    Liang Wu
    Journal of Genetics and Genomics, 2024, 51 (08) : 884 - 887
  • [28] Energy Efficient Clustering Algorithm for Data Aggregation in Wireless sensor network
    Ahir, Binkal S.
    Parmar, Rohan
    Kadhiwala, Bintu
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 683 - 688
  • [29] An efficient clustering scheme to exploit hierarchical data in network traffic analysis
    Mahmood, Abdun Naser
    Leckie, Christopher
    Udaya, Parampalli
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (06) : 752 - 767
  • [30] An Evolutionary Immune Network based on Kernel method for data clustering
    Wu, Lei
    Peng, Lei
    Ye, Ya-Lan
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1759 - +