Progressive high-dimensional similarity join

被引:0
|
作者
Tok, Wee Hyong [1 ]
Bressan, Stephane [1 ]
Lee, Mong-Li [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Rate-Based Progressive Join (RPJ) is a non-blocking relational equijoin algorithm. It is an equijoin that can deliver results progressively. In this paper, we first present a naive extension, called neRPJ, to the progressive computation of the similarity join of high-dimensional data. We argue that this naive extension is not suitable. We therefore propose an adequate solution in the form of a Result-Rate Progressive Join (RRPJ) for high-dimensional distance similarity joins. Using both synthetic and real-life datasets, we empirically show that RRPJ is effective and efficient, and outperforms the naive extension.
引用
收藏
页码:233 / +
页数:2
相关论文
共 50 条
  • [1] A novel approach for high-dimensional vector similarity join query
    Ma, Youzhong
    Jia, Shijie
    Zhang, Yongxin
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (05):
  • [2] Epsilon grid order:: An algorithm for the similarity join on massive high-dimensional data
    Böhm, C
    Braunmüller, B
    Krebs, F
    Kriege, HP
    [J]. SIGMOD RECORD, 2001, 30 (02) : 379 - 388
  • [3] PHiDJ: Parallel Similarity Self-Join for High-Dimensional Vector Data with MapReduce
    Fries, Sergej
    Boden, Brigitte
    Stepien, Grzegorz
    Seidl, Thomas
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 796 - 807
  • [4] High-dimensional similarity joins
    Shim, K
    Srikant, R
    Agrawal, R
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 156 - 171
  • [5] High-dimensional similarity joins
    Shim, K
    Srikant, R
    Agrawal, R
    [J]. 13TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING - PROCEEDINGS, 1997, : 301 - 311
  • [6] Projection Based Large Scale High-Dimensional Data Similarity Join Using MapReduce Framework
    Ma, Youzhong
    Zhang, Ruiling
    Cui, Zhanyou
    Lin, Chunjie
    [J]. IEEE ACCESS, 2020, 8 : 121665 - 121677
  • [7] An efficient similarity join approach on large-scale high-dimensional data using random projection
    Ma, Youzhong
    Zhang, Ruiling
    Jia, Shijie
    Zhang, Yongxin
    Meng, Xiaofeng
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (20):
  • [8] k Nearest Neighbor Similarity Join Algorithm on High-Dimensional Data Using Novel Partitioning Strategy
    Ma, Youzhong
    Hua, Qiaozhi
    Wen, Zheng
    Zhang, Ruiling
    Zhang, Yongxin
    Li, Haipeng
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [9] High-dimensional similarity retrieval using dimensional choice
    Tahmoush, Dave
    Samet, Hanan
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 490 - 497
  • [10] High-dimensional similarity retrieval using dimensional choice
    Tahmoush, Dave
    Samet, Hanan
    [J]. SISAP 2008: FIRST INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2008, : 35 - 42