Worst-case Optimal Join Algorithms

被引:71
|
作者
Ngo, Hung Q. [1 ,4 ]
Porat, Ely [2 ]
Re, Christopher [3 ,5 ]
Rudra, Atri [1 ,4 ]
机构
[1] Univ Buffalo SUNY, Buffalo, NY 14260 USA
[2] Bar Ilan Univ, IL-5290002 Ramat Gan, Israel
[3] Stanford Univ, Stanford, CA 94305 USA
[4] Univ Buffalo, 338 Davis Hall, Buffalo, NY 14214 USA
[5] Gates Comp Sci Bldg,353 Serra Mall, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Join Algorithms; fractional cover bound; Loomis-Whitney inequality; Bollobas-Thomason inequality; QUERY EVALUATION; NUMBER; BOUNDS; SETS;
D O I
10.1145/3180143
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient join processing is one of the most fundamental and well-studied tasks in database research. In this work, we examine algorithms for natural join queries over many relations and describe a new algorithm to process these queries optimally in terms of worst-case data complexity. Our result builds on recent work by Atserias, Grohe, and Marx, who gave bounds on the size of a natural join query in terms of the sizes of the individual relations in the body of the query. These bounds, however, are not constructive: they rely on Shearer's entropy inequality, which is information-theoretic. Thus, the previous results leave open the question of whether there exist algorithms whose runtimes achieve these optimal bounds. An answer to this question may be interesting to database practice, as we show in this article that any project-join style plans, such as ones typically employed in a relational database management system, are asymptotically slower than the optimal for some queries. We present an algorithm whose runtime is worst-case optimal for all natural join queries. Our result may be of independent interest, as our algorithm also yields a constructive proof of the general fractional cover bound by Atserias, Grohe, and Marx without using Shearer's inequality. This bound implies two famous inequalities in geometry: the Loomis-Whitney inequality and its generalization, the Bollobas-Thomason inequality. Hence, our results algorithmically prove these inequalities as well. Finally, we discuss how our algorithm can be used to evaluate full conjunctive queries optimally, to compute a relaxed notion of joins and to optimally (in the worst-case) enumerate all induced copies of a fixed subgraph inside of a given large graph.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Worst-Case Optimal Join Algorithms: Techniques, Results, and Open Problems
    Ngo, Hung Q.
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 111 - 124
  • [2] A Worst-Case Optimal Join Algorithm for SPARQL
    Hogan, Aidan
    Riveros, Cristian
    Rojas, Carlos
    Soto, Adrian
    SEMANTIC WEB - ISWC 2019, PT I, 2019, 11778 : 258 - 275
  • [3] ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Join Algorithms via Reinforcement Learning
    Wang, Junxiong
    Trummer, Immanuel
    Kara, Ahmet
    Olteanu, Dan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 2805 - 2817
  • [4] On the limitations of worst-case optimal ray shooting algorithms
    Szirmay-Kalos, L
    Marton, G
    WSCG '97: THE FIFTH INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS AND VISUALIZATION '97, CONFERENCE PROCEEDINGS, VOL 1-4, 1997, : 562 - 571
  • [5] WORST-CASE ANALYSIS OF ALGORITHMS
    VANTRIGT, C
    PHILIPS JOURNAL OF RESEARCH, 1978, 33 (1-2) : 66 - 77
  • [6] Analysis and construction of worst-case optimal ray shooting algorithms
    Szirmay-Kalos, Laszlo
    Marton, Gabor
    Computers and Graphics (Pergamon), 1998, 22 (2-3): : 167 - 174
  • [7] Analysis and construction of worst-case optimal ray shooting algorithms
    Szirmay-Kalos, L
    Marton, G
    COMPUTERS & GRAPHICS, 1998, 22 (2-3) : 167 - 174
  • [8] Querying APIs with SPARQL: Language and Worst-Case Optimal Algorithms
    Mosser, Matthieu
    Pieressa, Fernando
    Reutter, Juan
    Soto, Adrian
    Vrgoc, Domagoj
    SEMANTIC WEB (ESWC 2018), 2018, 10843 : 639 - 654
  • [9] OPTIMAL HISTOGRAMS FOR LIMITING WORST-CASE ERROR PROPAGATION IN THE SIZE OF JOIN RESULTS
    IOANNIDIS, YE
    CHRISTODOULAKIS, S
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1993, 18 (04): : 709 - 748
  • [10] Split Packing: Algorithms for Packing Circles with Optimal Worst-Case Density
    Sándor P. Fekete
    Sebastian Morr
    Christian Scheffer
    Discrete & Computational Geometry, 2019, 61 : 562 - 594