Adaptive algorithms for set containment joins

被引:29
|
作者
Melnik, S
Garcia-Molina, H
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Univ Leipzig, Dept Comp Sci, D-04109 Leipzig, Germany
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2003年 / 28卷 / 01期
关键词
algorithms; experimentation; performance;
D O I
10.1145/762471.762474
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A set containment join is a join between set-valued attributes of two relations, whose join condition is specified using the subset (subset of or equal to) operator. Set containment joins are deployed in many database applications, even those that do not support set-valued attributes. In this article, we propose two novel partitioning algorithms, called the Adaptive Pick-and-Sweep Join (APSJ) and the Adaptive Divide-and-Conquer Join (ADCJ), which allow computing set containment joins efficiently. We show that APSJ outperforms previously suggested algorithms for many data sets, often by an order of magnitude. We present a detailed analysis of the algorithms and study their performance on real and synthetic data using an implemented testbed.
引用
下载
收藏
页码:56 / 99
页数:44
相关论文
共 50 条
  • [41] Set-membership adaptive kernel NLMS algorithms: Design and analysis
    Flores, Andre
    de Lamare, Rodrigo C.
    SIGNAL PROCESSING, 2019, 154 : 1 - 14
  • [42] Top-k Set Similarity Joins
    Xiao, Chuan
    Wang, Wei
    Lin, Xuemin
    Shang, Haichuan
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 916 - +
  • [43] On the complexity of division and set joins in the relational algebra
    Leinders, Dirk
    Van den Bussche, Jan
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2007, 73 (04) : 538 - 549
  • [44] Overlap Set Similarity Joins with Theoretical Guarantees
    Deng, Dong
    Tao, Yufei
    Li, Guoliang
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 905 - 920
  • [45] AdaptDB: Adaptive Partitioning for Distributed Joins
    Lu, Yi
    Shanbhag, Anil
    Jindal, Alekh
    Madden, Samuel
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05): : 589 - 600
  • [46] Set Similarity Joins on MapReduce: An Experimental Survey
    Fier, Fabian
    Augsten, Nikolaus
    Bouros, Panagiotis
    Leser, Ulf
    Freytag, Johann-Christoph
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1110 - 1122
  • [47] A SEYFERT-GALAXY JOINS THE JET SET
    WILLS, BJ
    NATURE, 1985, 313 (6005) : 741 - 741
  • [48] Adaptive load diffusion for stream joins
    Gu, XH
    Yu, PS
    MIDDLEWARE 2005, PROCEEDINGS, 2005, 3790 : 411 - 420
  • [49] Selectivity Estimation on Set Containment Search
    Yang, Yang
    Zhang, Wenjie
    Zhang, Ying
    Lin, Xuemin
    Wang, Liping
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 330 - 349
  • [50] Set containment characterization for quasiconvex programming
    Suzuki, Satoshi
    Kuroiwa, Daishi
    JOURNAL OF GLOBAL OPTIMIZATION, 2009, 45 (04) : 551 - 563