High-dimensional kNN joins with incremental updates

被引:46
|
作者
Yu, Cui [1 ]
Zhang, Rui [2 ]
Huang, Yaochun [3 ]
Xiong, Hui [4 ]
机构
[1] Monmouth Univ, W Long Branch, NJ 07764 USA
[2] Univ Melbourne, Carlton, Vic 3053, Australia
[3] Univ Texas Dallas, Dallas, TX 75080 USA
[4] Rutgers State Univ, Newark, NJ 07102 USA
关键词
Query optimization; Storage & access; Optimization and performance;
D O I
10.1007/s10707-009-0076-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The k Nearest Neighbor (kNN) join operation associates each data object in one data set with its k nearest neighbors from the same or a different data set. The kNN join on high-dimensional data (high-dimensional kNN join) is a very expensive operation. Existing high-dimensional kNN join algorithms were designed for static data sets and therefore cannot handle updates efficiently. In this article, we propose a novel kNN join method, named kNNJoin(+), which supports efficient incremental computation of kNN join results with updates on high-dimensional data. As a by-product, our method also provides answers for the reverse kNN queries with very little overhead. We have performed an extensive experimental study. The results show the effectiveness of kNNJoin(+) for processing high-dimensional kNN joins in dynamic workloads.
引用
收藏
页码:55 / 82
页数:28
相关论文
共 50 条
  • [1] High-dimensional kNN joins with incremental updates
    Cui Yu
    Rui Zhang
    Yaochun Huang
    Hui Xiong
    [J]. GeoInformatica, 2010, 14 : 55 - 82
  • [2] High-dimensional similarity joins
    Shim, K
    Srikant, R
    Agrawal, R
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 156 - 171
  • [3] High-dimensional similarity joins
    Shim, K
    Srikant, R
    Agrawal, R
    [J]. 13TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING - PROCEEDINGS, 1997, : 301 - 311
  • [4] Interpretable SAM-kNN Regressor for Incremental Learning on High-Dimensional Data Streams
    Jakob, Jonathan
    Artelt, Andre
    Hasenjaeger, Martina
    Hammer, Barbara
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [5] Parallel algorithms for high-dimensional proximity joins
    Shafer, JC
    Agrawal, R
    [J]. PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, 1997, : 176 - 185
  • [6] Similarity joins for high-dimensional data using Spark
    Rong, Chuitian
    Cheng, Xiaohai
    Chen, Ziliang
    Huo, Na
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (20):
  • [7] Boosting for Vote Learning in High-dimensional kNN Classification
    Tomasev, Nenad
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 676 - 683
  • [8] Incremental Extraction of High-Dimensional Equivalence Structures
    Satoh, Seiya
    Yamakawa, Hiroshi
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1518 - 1524
  • [9] Efficient kNN Join over Dynamic High-Dimensional Data
    Ukey, Nimish
    Yang, Zhengyi
    Zhang, Guangjian
    Liu, Boge
    Li, Binghao
    Zhang, Wenjie
    [J]. DATABASES THEORY AND APPLICATIONS (ADC 2022), 2022, 13459 : 63 - 75
  • [10] FPGA Acceleration of Approximate KNN Indexing on High-Dimensional Vectors
    Danopoulos, Dimitrios
    Kachris, Christoforos
    Soudris, Dimitrios
    [J]. 2019 14TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC 2019), 2019, : 59 - 65