FPGA Acceleration of Approximate KNN Indexing on High-Dimensional Vectors

被引:0
|
作者
Danopoulos, Dimitrios [1 ]
Kachris, Christoforos [2 ,3 ]
Soudris, Dimitrios [1 ]
机构
[1] NTUA, Dept Elect & Comp Engn, Athens, Greece
[2] Democritus Univ Thrace, Athens, Greece
[3] NTUA, ICCS, Athens, Greece
关键词
approximate KNN; nearest neighbor index; machine learning; FPGA; hardware accelerator;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Accurate and efficient Machine Learning algorithms are of vital importance to many problems, especially on classification or clustering tasks. One the most important algorithms used for similarity search is known as K-Nearest Neighbor algorithm (KNN) which is widely adopted for predictive analysis, text categorization, image recognition etc. but comes at the cost of high computation. Large companies that process big data on modern data centers adopt this technique combined with approximations on algorithm level in order to compute critical workloads every second. However, a significant computation and energy overhead is formed further with the high dimensional nearest neighbor queries. In this paper, we deploy a hardware accelerated approximate KNN algorithm built upon FAISS framework (Facebook Artificial Intelligence Similarity Search) using FPGA-OpenCL platforms. The FPGA architecture on this framework addresses the problem of vector indexing on training and adding large-scale high-dimensional data. The proposed solution uses an in memory FPGA format that outperforms other high performance systems in terms of speed and energy efficiency. The experiments were done on Xilinx Alveo U200 FPGA achieving up to 115x accelerator-only speed-up over single-core CPU and 2.4 x end-to-end system speed-up over a 36 -thread Xeon CPU. Also, the performance/watt of the design was 4.1x from the same CPU and 1.4 x from a Kepler-class GPU.
引用
下载
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [41] High-Dimensional Approximate r-Nets
    Z. Avarikioti
    I. Z. Emiris
    L. Kavouras
    I. Psarros
    Algorithmica, 2020, 82 : 1675 - 1702
  • [42] Efficient kNN Join over Dynamic High-Dimensional Data
    Ukey, Nimish
    Yang, Zhengyi
    Zhang, Guangjian
    Liu, Boge
    Li, Binghao
    Zhang, Wenjie
    DATABASES THEORY AND APPLICATIONS (ADC 2022), 2022, 13459 : 63 - 75
  • [43] Efficient parallel processing of high-dimensional spatial kNN queries
    Jiang, Tao
    Zhang, Bin
    Lin, Dan
    Gao, Yunjun
    Li, Qing
    SOFT COMPUTING, 2022, 26 (22) : 12291 - 12316
  • [44] BrePartition: Optimized High-Dimensional kNN Search With Bregman Distances
    Song, Yang
    Gu, Yu
    Zhang, Rui
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1053 - 1065
  • [45] Adaptive quantization of the high-dimensional data for efficient KNN processing
    Cui, B
    Hu, J
    Shen, HT
    Yu, C
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 302 - 313
  • [46] kNN Join for Dynamic High-Dimensional Data: A Parallel Approach
    Ukey, Nimish
    Yang, Zhengyi
    Yang, Wenke
    Li, Binghao
    Li, Runze
    DATABASES THEORY AND APPLICATIONS, ADC 2023, 2024, 14386 : 3 - 16
  • [47] Efficient parallel processing of high-dimensional spatial kNN queries
    Tao Jiang
    Bin Zhang
    Dan Lin
    Yunjun Gao
    Qing Li
    Soft Computing, 2022, 26 : 12291 - 12316
  • [48] Linearization approach for efficient KNN search of high-dimensional data
    Al Aghbari, Z
    Makinouchi, A
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 229 - 238
  • [49] An indexing technique using relative approximation for high-dimensional data
    Sakurai, Y., 1600, John Wiley and Sons Inc. (34):
  • [50] A Novel Indexing Method for Improving Timeliness of High-Dimensional Data
    Lu, Jian
    Pham, Huong
    Zhu, Hongwei
    Chen, Cindy
    AMCIS 2014 PROCEEDINGS, 2014,