An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory

被引:60
|
作者
Schuh, Stefan [1 ]
Chen, Xiao [1 ]
Dittrich, Jens [1 ]
机构
[1] Saarland Univ, Informat Syst Grp, Saarbrucken, Germany
关键词
MULTI-CORE;
D O I
10.1145/2882903.2882917
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relational equi-joins are at the heart of almost every query plan. They have been studied, improved, and reexamined on a regular basis since the existence of the database community. In the past four years several new join algorithms have been proposed and experimentally evaluated. Some of those papers contradict each other in their experimental findings. This makes it surprisingly hard to answer a very simple question: what is the fastest join algorithm in 2015? In this paper we will try to develop an answer. We start with an end-to-end black box comparison of the most important methods. Afterwards, we inspect the internals of these algorithms in a white box comparison. We derive improved variants of state-of-the-art join algorithms by applying optimizations like software write combine buffers, various hash table implementations, as well as NUMA-awareness in terms of data placement and scheduling. We also inspect various radix partitioning strategies. Eventually, we are in the position to perform a comprehensive comparison of thirteen different join algorithms. We factor in scaling effects in terms of size of the input datasets, the number of threads, different page sizes, and data distributions. Furthermore, we analyze the impact of various joins on an (unchanged) TPC-H query. Finally, we conclude with a list of major lessons learned from our study and a guideline for practitioners implementing massive main-memory joins. As is the case with almost all algorithms in databases, we will learn that there is no single best join algorithm. Each algorithm has its strength and weaknesses and shines in different areas of the parameter space.
引用
收藏
页码:1961 / 1976
页数:16
相关论文
共 50 条
  • [1] An Experimental Study on Federated Equi-Joins
    Li, Shuyuan
    Zeng, Yuxiang
    Wang, Yuxiang
    Zhong, Yiman
    Zhou, Zimu
    Tong, Yongxin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4443 - 4457
  • [2] Scaling Equi-Joins
    Metwally, Ahmed
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2163 - 2176
  • [3] Beyond Equi-joins: Ranking, Enumeration and Factorization
    Tziavelis, Nikolaos
    Gatterbauer, Wolfgang
    Riedewald, Mirek
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2599 - 2612
  • [4] Equi-Joins over Encrypted Data for Series of Queries
    Shafieinejad, Masoumeh
    Gupta, Suraj
    Liu, Jin Yang
    Karabina, Koray
    Kerschbaum, Florian
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 1635 - 1648
  • [5] What Is the Price for Joining Securely? Benchmarking Equi-Joins in Trusted Execution Environments
    Maliszewski, Kajetan
    Quiane-Ruiz, Jorge-Arnulfo
    Traub, Jonas
    Markl, Volker
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (03): : 659 - 672
  • [6] An Experimental Analysis of Iterated Spatial Joins in Main Memory
    Sowell, Benjamin
    Salles, Marcos Vaz
    Cao, Tuan
    Demers, Alan
    Gehrke, Johannes
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1882 - 1893
  • [7] Spatial Joins in Main Memory: Implementation Matters!
    Sidlauskas, Darius
    Jensen, Christian S.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (01): : 97 - 100
  • [8] Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach
    Jha, Saurabh
    He, Bingsheng
    Lu, Mian
    Cheng, Xuntao
    Huynh Phung Huynh
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (06): : 642 - 653
  • [9] Configuring Spatial Grids for Efficient Main Memory Joins
    Tauheed, Farhan
    Heinis, Thomas
    Ailamaki, Anastasia
    [J]. DATA SCIENCE, 2015, 9147 : 199 - 205
  • [10] MQJoin: Efficient Shared Execution of Main-Memory Joins
    Makreshanski, Darko
    Giannikis, Georgios
    Alonso, Gustavo
    Kossmann, Donald
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (06): : 480 - 491