Fast joins using join indices

被引:32
|
作者
Li, Z [1 ]
Ross, KA [1 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
VLDB JOURNAL | 1999年 / 8卷 / 01期
关键词
query processing; decision support systems;
D O I
10.1007/s007780050071
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Two new algorithms, "Jive join" and "Slam join," are proposed for computing the join of two relations using a join index. The algorithms are duals: Jive join range-partitions input relation tuple ids and then processes each partition, while Slam join forms ordered runs of input relation tuple ids and then merges the results. Both algorithms make a single sequential pass through each input relation, in addition to one pass through the join index and two passes through a temporary file, whose size is half that of the join index. Both algorithms require only that the number of blocks in main memory is of the order of the square root of the number of blocks in the smaller relation. By storing intermediate and final join results in a vertically partitioned fashion, our algorithms need to manipulate less data in memory at a given time than other algorithms. The algorithms are resistant to data skew and adaptive to memory fluctuations. Selection conditions can be incorporated into the algorithms. Using a detailed cost model, the algorithms are analyzed and compared with competing algorithms. For large input relations, our algorithms perform significantly better than Valduriez's algorithm, the TID join algorithm, and hash join algorithms. An experimental study is also conducted to validate the analytical results and to demonstrate the performance characteristics of each algorithm in practice.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 50 条
  • [1] Fast joins using join indices
    Zhe Li
    Kenneth A. Ross
    [J]. The VLDB Journal, 1999, 8 : 1 - 24
  • [2] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    [J]. DATA & KNOWLEDGE ENGINEERING, 1999, 29 (02) : 179 - 200
  • [3] Faster joins, self-joins and multi-way joins using join indices
    Lei, H
    Ross, KA
    [J]. DATA & KNOWLEDGE ENGINEERING, 1998, 28 (03) : 277 - 298
  • [4] Faster joins, self-joins and multi-way joins using join indices
    Lei, Hui
    Ross, Kenneth A.
    [J]. Data and Knowledge Engineering, 1999, 29 (02): : 179 - 200
  • [5] Selective Wander Join: Fast Progressive Visualizations for Data Joins
    Procopio, Marianne
    Scheidegger, Carlos
    Wu, Eugene
    Chang, Remco
    [J]. INFORMATICS-BASEL, 2019, 6 (01):
  • [6] Efficient temporal join processing using indices
    Zhang, DH
    Tsotras, VJ
    Seeger, B
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 103 - 113
  • [7] Wander Join: Online Aggregation for Joins
    Li, Feifei
    Wu, Bin
    Yi, Ke
    Zhao, Zhuoyue
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2121 - 2124
  • [8] Using slice join for efficient evaluation of multi-way joins
    Lawrence, Ramon
    [J]. DATA & KNOWLEDGE ENGINEERING, 2008, 67 (01) : 118 - 139
  • [9] Evaluation of main memory join algorithms for joins with subset join predicates
    Helmer, S
    Moerkotte, G
    [J]. PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, 1997, : 386 - 395
  • [10] To Join or Not to Join? Thinking Twice about Joins before Feature Selection
    Kumar, Arun
    Naughton, Jeffrey
    Patel, Jignesh M.
    Zhu, Xiaojin
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 19 - 34