Fast joins using join indices

被引:32
|
作者
Li, Z [1 ]
Ross, KA [1 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
VLDB JOURNAL | 1999年 / 8卷 / 01期
关键词
query processing; decision support systems;
D O I
10.1007/s007780050071
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Two new algorithms, "Jive join" and "Slam join," are proposed for computing the join of two relations using a join index. The algorithms are duals: Jive join range-partitions input relation tuple ids and then processes each partition, while Slam join forms ordered runs of input relation tuple ids and then merges the results. Both algorithms make a single sequential pass through each input relation, in addition to one pass through the join index and two passes through a temporary file, whose size is half that of the join index. Both algorithms require only that the number of blocks in main memory is of the order of the square root of the number of blocks in the smaller relation. By storing intermediate and final join results in a vertically partitioned fashion, our algorithms need to manipulate less data in memory at a given time than other algorithms. The algorithms are resistant to data skew and adaptive to memory fluctuations. Selection conditions can be incorporated into the algorithms. Using a detailed cost model, the algorithms are analyzed and compared with competing algorithms. For large input relations, our algorithms perform significantly better than Valduriez's algorithm, the TID join algorithm, and hash join algorithms. An experimental study is also conducted to validate the analytical results and to demonstrate the performance characteristics of each algorithm in practice.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 50 条
  • [41] Fast Computation of Stock Market Indices Using GPUs
    Induwara, Supun
    Jayasena, Sanath
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 237 - 241
  • [42] Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints
    Xiao, Chuan
    Wang, Wei
    Lin, Xuemin
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 933 - 944
  • [43] Fast Interval Joins for Temporal SPARQL Queries
    Chekol, Melisachew Wudage
    Pirro, Giuseppe
    Stuckenschmidt, Heiner
    COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 1148 - 1154
  • [44] Technical Perspective: Data Distribution for Fast Joins
    Libkin, Leonid
    SIGMOD RECORD, 2016, 45 (01) : 32 - 32
  • [45] Fast and scalable vector similarity joins with MapReduce
    Byoungju Yang
    Hyun Joon Kim
    Junho Shim
    Dongjoo Lee
    Sang-goo Lee
    Journal of Intelligent Information Systems, 2016, 46 : 473 - 497
  • [46] Lightning Fast and Space Efficient Inequality Joins
    Khayyat, Zuhair
    Lucia, William
    Singh, Meghna
    Ouzzani, Mourad
    Papotti, Paolo
    Quiane-Ruiz, Jorge-Arnulfo
    Tang, Nan
    Kalnis, Panos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (13): : 2074 - 2085
  • [47] Fast and scalable vector similarity joins with MapReduce
    Yang, Byoungju
    Kim, Hyun Joon
    Shim, Junho
    Lee, Dongjoo
    Lee, Sang-goo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (03) : 473 - 497
  • [48] Baltic Design Centre joins in the fast lane
    不详
    NAVAL ARCHITECT, 2007, : 93 - 93
  • [49] RCA JOINS THE FAST-CMOS FRAY
    SPADARO, JJ
    ELECTRONIC PRODUCTS MAGAZINE, 1986, 28 (24): : 24 - 24
  • [50] RayJoin: Fast and Precise Spatial Join
    Geng, Liang
    Lee, Rubao
    Zhang, Xiaodong
    PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024, 2024, : 124 - 136