Using slice join for efficient evaluation of multi-way joins

被引:5
|
作者
Lawrence, Ramon [1 ]
机构
[1] Univ British Columbia Okanagan, Kelowna, BC V1V 1V7, Canada
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
early join algorithm; reading policy; interactive querying; adaptive; hashing;
D O I
10.1016/j.datak.2008.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A standard hash join algorithm joins two relations at a time and requires reading the entire smaller input before results are generated. There has been recent focus on constructing join algorithms that produce results faster and can join more than two relations simultaneously. Early joins that are capable of producing results before reading the smaller relation are useful for network joins where the input arrival rates may vary as the operator can adapt without explicit query re-optimization. Multi-way joins improve performance by reducing the number of intermediate results generated and are more resilient to poor estimates by the query optimizer. The only join algorithm that combines the two features of multi-way support and early result production is limited to processing joins where all inputs are joined on the same attribute. In this work, we propose a new hash-based join algorithm called slice join. Slice join is an early, multi-way join algorithm capable of joining relations on common attributes and relations connected by a sequence of functional dependencies. Slice join is useful for a larger number of query plans, performs fewer disk operations, and has a simpler duplicate detection technique than previous approaches. Experimental results demonstrate that slice join outperforms other multi-way join operators and binary join plans. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:118 / 139
页数:22
相关论文
共 50 条
  • [31] Native Execution of GraphQL Queries over RDF Graphs Using Multi-Way Joins
    Karalis, Nikolaos
    Bigerl, Alexander
    Ngonga Ngomo, Axel-Cyrille
    KNOWLEDGE GRAPHS: SEMANTICS, MACHINE LEARNING, AND LANGUAGES, 2023, 56 : 77 - 93
  • [32] Multi-way Time Series Join on Multi-length Patterns
    Mollah, Md Parvez
    Souza, Vinicius M. A.
    Mueen, Abdullah
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 429 - 438
  • [33] Generalized communication cost efficient multi-way spatial join: revisiting the curse of the last reducer
    Bhattu, S. Nagesh
    Potluri, Avinash
    Kadari, Prashanth
    Subramanyam, R. B. V.
    GEOINFORMATICA, 2020, 24 (03) : 557 - 589
  • [34] AutoMJ: Towards Efficient Multi-way Join Query on Distributed Data-parallel Platform
    Zhu, Guanghui
    Wu, Xiaoqi
    Gu, Rong
    Yuan, Chunfeng
    Huang, Yihua
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 161 - 169
  • [35] Generalized communication cost efficient multi-way spatial join: revisiting the curse of the last reducer
    S. Nagesh Bhattu
    Avinash Potluri
    Prashanth Kadari
    Subramanyam R. B. V.
    GeoInformatica, 2020, 24 : 557 - 589
  • [36] Towards Multi-way Join Aware Optimizer in SAP HANA
    Wi, Sungheun
    Han, Wook-Shin
    Chang, Chuho
    Kim, Kihong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (12): : 3019 - 3031
  • [37] Towards Multi-way Join Aware Optimizer in SAP HANA
    Wi, Sungheun
    Han, Wook-Shin
    Chang, Chuho
    Kim, Kihong
    Proceedings of the VLDB Endowment, 2020, 13 (12): : 3019 - 3031
  • [38] 基于并行的Multi-Way Join查询优化
    孙晓川
    杨磊
    山东师大学报(自然科学版) , 1998, (04) : 89 - 92
  • [39] Distributed Spatial Join Processing for Multiple Spatial Datasets - Multi-way Spatial Join
    Cunha, Anderson R.
    de Oliveira, Savio S. T.
    de Oliveira, Thiago B.
    Aleixo, Everton L.
    Cardoso, Marcelo de C.
    do Sacramento Rodrigues, Vagner J.
    2015 XXXIII BRAZILIAN SYMPOSIUM ON COMPUTER NETWORKS AND DISTRIBUTED SYSTEMS, 2015, : 171 - 181
  • [40] Adaptive Optimisation For Continuous Multi-Way Joins Over RDF Streams
    Danh Le-Phuoc
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1857 - 1865