Robust Join Processing with Diamond Hardened Joins

被引:0
|
作者
Birler, Altan [1 ]
Kemper, Alfons [1 ]
Neumann, Thomas [1 ]
机构
[1] Tech Univ Munich, Munich, Germany
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2024年 / 17卷 / 11期
关键词
QUERY PLANS; LOOKING;
D O I
10.14778/3681954.3681995
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Join ordering and join processing has a huge impact on query execution and can easily affect the query response time by orders of magnitude. In particular, when joins are potentially growing n:m joins, execution can be very expensive. This can be seen by examining the sizes of intermediate results: If a join query produces many redundant tuples that are later eliminated, the query is likely expensive, which is not justified by the query result. This gives the query a diamond shape, with intermediate results larger than the inputs and the output. This occurs frequently in various workloads, particularly, in graph workloads, and also in benchmarks like JOB. We call this issue the diamond problem, and to address it, we propose the diamond hardened join framework, which splits join operators into two suboperators: Lookup & Expand. By allowing these suboperators to be freely reordered by the query optimizer, we improve the runtime of queries that exhibit the diamond problem without sacrificing performance for the rest of the queries. Past theoretical work such as worst-case optimal joins similarly try to avoid huge intermediate results. However, these approaches have significant overheads that impact all queries. We demonstrate that our approach leads to excellent performance both in queries that exhibit the diamond problem and in regular queries that can be handled by traditional binary joins. This allows for a unified approach, offering excellent performance across the board. Compared to traditional joins, queries' performance is improved by up to 500x in the CE benchmark and remains excellent in TPC-H and JOB.
引用
收藏
页码:3215 / 3228
页数:14
相关论文
共 50 条
  • [11] Track Join: Distributed Joins with Minimal Network Traffic
    Polychroniou, Orestis
    Sen, Rajkumar
    Ross, Kenneth A.
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1483 - 1494
  • [12] Plug&Join: An easy-to-use generic algorithm for efficiently processing equi and non-equi joins
    van den Bercken, J
    Schneider, M
    Seeger, B
    ADVANCES IN DATABASE TECHNOLOGY-DEBT 2000, PROCEEDINGS, 2000, 1777 : 495 - 509
  • [13] Joins vs. Links or Relational Join Considered Harmful
    Savinov, Alexandr
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 362 - 368
  • [14] Selective Wander Join: Fast Progressive Visualizations for Data Joins
    Procopio, Marianne
    Scheidegger, Carlos
    Wu, Eugene
    Chang, Remco
    INFORMATICS-BASEL, 2019, 6 (01):
  • [15] Random access with a distributed Bitmap Join Index for Star Joins
    Brito, Jaqueline J.
    Mosqueiro, Thiago
    Ciferri, Ricardo R.
    Ciferri, Cristina D. A.
    HELIYON, 2020, 6 (02)
  • [16] Improvement of Join Algorithms for Low-Selectivity Joins on MapReduce
    Matono, Akiyoshi
    Ogawa, Hirotaka
    Kojima, Isao
    DATABASES THEORY AND APPLICATIONS, 2015, 9093 : 117 - 128
  • [17] Efficient processing of spatiotemporal joins
    Zimbrao, G
    De Souza, JM
    De Almeida, VT
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 190 - 195
  • [18] PLAN TO JOIN DIAMOND DRILLERS IN VANCOUVER
    SCALES, M
    CANADIAN MINING JOURNAL, 1983, 104 (03) : 28 - 28
  • [19] PROCESSING HARDENED STEEL
    BORRUSO, M
    PLATING AND SURFACE FINISHING, 1995, 82 (02): : 38 - 38
  • [20] DIAMOND HONING OF HARDENED CAST IRON
    NEKRASOV, SS
    BOICHUK, IF
    RUSSIAN ENGINEERING JOURNAL-USSR, 1968, 48 (04): : 79 - &