Top-k spatial joins of probabilistic objects

被引:24
|
作者
Ljosa, Vebjorn [1 ]
Singh, Ambuj K. [2 ]
机构
[1] MIT, Broad Inst, 7 Cambridge Ctr, Cambridge, MA 02142 USA
[2] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
D O I
10.1109/ICDE.2008.4497465
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Probabilistic data have recently become popular in applications such as scientific and geospatial databases. For images and other spatial datasets, probabilistic values can capture the uncertainty in extent and class of the objects in the images. Relating one such dataset to another by spatial joins is an important operation for data management systems. We consider probabilistic spatial join (PSJ) queries, which rank the results according to a score that incorporates both the uncertainties associated with the objects and the distances between them. We present algorithms for two kinds of PSJ queries: Threshold PSJ queries, which return all pairs that score above a given threshold, and top-k PSJ queries, which return the k top-scoring pairs. For threshold PSJ queries, we propose a plane sweep algorithm that, because it exploits the special structure of the problem, runs in O(n (log n + k)) time, where n is the number of points and k is the number of results. We extend the algorithms to 2-D data and to top-k PSJ queries. To further speed up top-k PSJ queries, we develop a scheduling technique that estimates the scores at the level of blocks, then hands the blocks to the plane sweep algorithm. By finding high-scoring pairs early, the scheduling allows a large portion of the datasets to be pruned. Experiments demonstrate speed-ups of two orders of magnitude.
引用
收藏
页码:566 / +
页数:2
相关论文
共 50 条
  • [1] Top-k spatial joins
    Zhu, ML
    Papadias, D
    Zhang, J
    Lee, DL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (04) : 567 - 579
  • [2] Top-k spatial distance joins
    Qi, Shuyao
    Bouros, Panagiotis
    Mamoulis, Nikos
    [J]. GEOINFORMATICA, 2020, 24 (03) : 591 - 631
  • [3] Top-k spatial distance joins
    Shuyao Qi
    Panagiotis Bouros
    Nikos Mamoulis
    [J]. GeoInformatica, 2020, 24 : 591 - 631
  • [4] Top-k String Similarity Joins
    Qi, Shuyao
    Bouros, Panagiotis
    Mamoulis, Nikos
    [J]. PROCEEDINGS OF THE 32TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2020, 2020,
  • [5] Top-k Set Similarity Joins
    Xiao, Chuan
    Wang, Wei
    Lin, Xuemin
    Shang, Haichuan
    [J]. ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 916 - +
  • [6] Efficient Processing of Top-k Joins in MapReduce
    Saouk, Mei
    Doulkeridis, Christos
    Vlachou, Akrivi
    Norvag, Kjetil
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 570 - 577
  • [7] Continuous top-k spatial–keyword search on dynamic objects
    Yuyang Dong
    Chuan Xiao
    Hanxiong Chen
    Jeffrey Xu Yu
    Kunihiro Takeoka
    Masafumi Oyamada
    Hiroyuki Kitagawa
    [J]. The VLDB Journal, 2021, 30 : 141 - 161
  • [8] Finding top-k relevant groups of spatial web objects
    Anders Skovsgaard
    Christian S. Jensen
    [J]. The VLDB Journal, 2015, 24 : 537 - 555
  • [9] Finding top-k relevant groups of spatial web objects
    Skovsgaard, Anders
    Jensen, Christian S.
    [J]. VLDB JOURNAL, 2015, 24 (04): : 537 - 555
  • [10] Distributed Evaluation of Top-k Temporal Joins
    Pilourdault, Julien
    Leroy, Vincent
    Amer-Yahia, Sihem
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1027 - 1039