Classic distance join queries using compact data structures

被引:0
|
作者
de Bernardo G. [1 ]
Penabad M.R. [1 ]
Corral A. [2 ]
Brisaboa N.R. [1 ]
机构
[1] Universidade da Coruña, Centro de investigación CITIC, A Coruña
[2] Dept. of Informatics, University of Almeria, Almeria
关键词
K closest pairs; k[!sup]2[!/sup]-tree; Spatial query evaluation; ε distance join;
D O I
10.1016/j.ins.2024.120732
中图分类号
学科分类号
摘要
Distance-based Join Queries (DJQs) have multiple applications in spatial databases, Geographic Information Systems, and other areas. The K Closest Pairs Query (KCPQ) and the ε Distance Join Query (εDJQ) are well-known DJQs that have been widely studied and can be solved using plane-sweep techniques, which are efficient but must keep the whole datasets in main memory. In this work, we propose DJQ algorithms that work with data represented using a k2-tree, a compact data structure for binary grids. Our algorithms solve KCPQ and εDJQ queries, as well as several window-constrained variants, taking advantage of the indexing capabilities of k2-trees to efficiently answer queries without the need to decompress the data. Our experimental evaluation with large datasets shows that k2-tree algorithms are up to 5 times faster than plane-sweep algorithms in KCPQ, and 5–30 times faster in εDJQ. In variants that are window-constrained, our algorithms are competitive in most scenarios and faster for large windows. Additionally, our algorithms are not very affected by the distribution of the data and yield much more predictable query times, showing up to 30 times smaller variance in query times than plane sweep, depending on the location of the query window. © 2024 The Author(s)
引用
收藏
相关论文
共 50 条
  • [21] Join Queries on Uncertain Data: Semantics and Efficient Processing
    Ge, Tingjian
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 697 - 708
  • [22] Approximate Distance Queries and Compact Routing in Sparse Graphs
    Agarwal, Rachit
    Godfrey, P. Brighten
    Har-Peled, Sariel
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 1754 - 1762
  • [23] A Method for Automatically Generating Join Queries Based on Relations-Attributes Distance Matrix over Data Lakes
    Zhang, Caicai
    Lu, Chenglang
    Mei, Zhuolin
    Wu, Bin
    Yu, Jing
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (05): : 1539 - 1546
  • [24] Estimating aggregate join queries over data streams using discrete cosine transform
    Jiang, Zhewei
    Luo, Cheng
    Hou, Wen-Chi
    Yan, Feng
    Zhu, Qiang
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 182 - 192
  • [25] Efficient of bitmap join indexes for optimising star join queries in relational data warehouses
    Yahyaoui, Mohammed
    Amjad, Souad
    Benameur, Lamia
    Jellouli, Ismail
    International Journal of Computational Intelligence Studies, 2020, 9 (03) : 220 - 233
  • [26] VA-files vs. R*-trees in distance join queries
    Corral, A
    D'Ermiliis, A
    Manolopoulos, Y
    Vassilakopoulos, M
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3631 : 153 - 166
  • [27] Efficient large-scale distance-based join queries in spatialhadoop
    Garcia-Garcia, Francisco
    Corral, Antonio
    Iribarne, Luis
    Vassilakopoulos, Michael
    Manolopoulos, Yannis
    GEOINFORMATICA, 2018, 22 (02) : 171 - 209
  • [28] Efficient large-scale distance-based join queries in spatialhadoop
    Francisco García-García
    Antonio Corral
    Luis Iribarne
    Michael Vassilakopoulos
    Yannis Manolopoulos
    GeoInformatica, 2018, 22 : 171 - 209
  • [29] Data Structures for Path Queries
    He, Meng
    Munro, J. Ian
    Zhou, Gelin
    ACM TRANSACTIONS ON ALGORITHMS, 2016, 12 (04)
  • [30] Transformation of continuous aggregation join queries over data streams
    Tran, Tri Minh
    Lee, Byung Suk
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2007, 4605 : 330 - +