Using slice join for efficient evaluation of multi-way joins

被引:5
|
作者
Lawrence, Ramon [1 ]
机构
[1] Univ British Columbia Okanagan, Kelowna, BC V1V 1V7, Canada
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
early join algorithm; reading policy; interactive querying; adaptive; hashing;
D O I
10.1016/j.datak.2008.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard hash join algorithm joins two relations at a time and requires reading the entire smaller input before results are generated. There has been recent focus on constructing join algorithms that produce results faster and can join more than two relations simultaneously. Early joins that are capable of producing results before reading the smaller relation are useful for network joins where the input arrival rates may vary as the operator can adapt without explicit query re-optimization. Multi-way joins improve performance by reducing the number of intermediate results generated and are more resilient to poor estimates by the query optimizer. The only join algorithm that combines the two features of multi-way support and early result production is limited to processing joins where all inputs are joined on the same attribute. In this work, we propose a new hash-based join algorithm called slice join. Slice join is an early, multi-way join algorithm capable of joining relations on common attributes and relations connected by a sequence of functional dependencies. Slice join is useful for a larger number of query plans, performs fewer disk operations, and has a simpler duplicate detection technique than previous approaches. Experimental results demonstrate that slice join outperforms other multi-way join operators and binary join plans. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:118 / 139
页数:22
相关论文
共 50 条
  • [1] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    DATA & KNOWLEDGE ENGINEERING, 1999, 29 (02) : 179 - 200
  • [2] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    DATA & KNOWLEDGE ENGINEERING, 1998, 28 (03) : 277 - 298
  • [3] Faster joins, self-joins and multi-way joins using join indices
    Lei, Hui
    Ross, Kenneth A.
    Data and Knowledge Engineering, 1999, 29 (02): : 179 - 200
  • [4] Efficient Evaluation of Conjunctive Regular Path Queries Using Multi-way Joins
    Karalis, Nikolaos
    Bigerl, Alexander
    Heidrich, Liss
    Sherif, Mohamed Ahmed
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB, PT I, ESWC 2024, 2024, 14664 : 218 - 235
  • [5] Efficient Multi-way Theta-Join Processing Using MapReduce
    Zhang, Xiaofei
    Chen, Lei
    Wang, Min
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11): : 1184 - 1195
  • [6] An Evaluation of Multi-way Joins for Relational Database Systems
    Henderson, Michael
    Lawrence, Ramon
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2013, 2014, 190 : 37 - 50
  • [7] Are Multi-way Joins Actually Useful?
    Henderson, Michael
    Lawrence, Ramon
    ICEIS: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2013, : 13 - 22
  • [8] Accelerating multi-way joins on the GPU
    Zhuohang Lai
    Xibo Sun
    Qiong Luo
    Xiaolong Xie
    The VLDB Journal, 2022, 31 : 529 - 553
  • [9] Accelerating multi-way joins on the GPU
    Lai, Zhuohang
    Sun, Xibo
    Luo, Qiong
    Xie, Xiaolong
    VLDB JOURNAL, 2022, 31 (03): : 529 - 553
  • [10] Multi-way spatial joins using R-trees: Methodology and performance evaluation
    Park, HH
    Cha, GH
    Chung, CW
    ADVANCES IN SPATIAL DATABASES, 1999, 1651 : 229 - 250