FPGA Acceleration of Approximate KNN Indexing on High-Dimensional Vectors

被引:0
|
作者
Danopoulos, Dimitrios [1 ]
Kachris, Christoforos [2 ,3 ]
Soudris, Dimitrios [1 ]
机构
[1] NTUA, Dept Elect & Comp Engn, Athens, Greece
[2] Democritus Univ Thrace, Athens, Greece
[3] NTUA, ICCS, Athens, Greece
关键词
approximate KNN; nearest neighbor index; machine learning; FPGA; hardware accelerator;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Accurate and efficient Machine Learning algorithms are of vital importance to many problems, especially on classification or clustering tasks. One the most important algorithms used for similarity search is known as K-Nearest Neighbor algorithm (KNN) which is widely adopted for predictive analysis, text categorization, image recognition etc. but comes at the cost of high computation. Large companies that process big data on modern data centers adopt this technique combined with approximations on algorithm level in order to compute critical workloads every second. However, a significant computation and energy overhead is formed further with the high dimensional nearest neighbor queries. In this paper, we deploy a hardware accelerated approximate KNN algorithm built upon FAISS framework (Facebook Artificial Intelligence Similarity Search) using FPGA-OpenCL platforms. The FPGA architecture on this framework addresses the problem of vector indexing on training and adding large-scale high-dimensional data. The proposed solution uses an in memory FPGA format that outperforms other high performance systems in terms of speed and energy efficiency. The experiments were done on Xilinx Alveo U200 FPGA achieving up to 115x accelerator-only speed-up over single-core CPU and 2.4 x end-to-end system speed-up over a 36 -thread Xeon CPU. Also, the performance/watt of the design was 4.1x from the same CPU and 1.4 x from a Kepler-class GPU.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [31] Adaptive Cluster Distance Bounding for High-Dimensional Indexing
    Ramaswamy, Sharadh
    Rose, Kenneth
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) : 815 - 830
  • [32] Concurrence vectors for entanglement of high-dimensional systems
    You-Quan Li
    Guo-Qiang Zhu
    Frontiers of Physics in China, 2008, 3 : 250 - 257
  • [33] High-dimensional generation of Bernoulli random vectors
    Modarres, Reza
    STATISTICS & PROBABILITY LETTERS, 2011, 81 (08) : 1136 - 1142
  • [34] QuEval: Beyond high-dimensional indexing a la carte
    Schaeler, Martin
    Grebhahn, Alexander
    Schroter, Reimar
    Schulze, Sandro
    Koeppen, Veit
    Saake, Gunter
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1654 - 1665
  • [35] A hyperplane based indexing technique for high-dimensional data
    Wang, Guoren
    Zhou, Xiangmin
    Wang, Bin
    Qiao, Baiyou
    Han, Donghong
    INFORMATION SCIENCES, 2007, 177 (11) : 2255 - 2268
  • [36] High-Dimensional Indexing Scheme for Scene Graph Retrieval
    Lee, Hyeonbyeong
    Song, Sangho
    Choi, Dojin
    Lim, Jongtae
    Bok, Kyoungsoo
    Yoo, Jaesoo
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [37] On the theory and practice of high-dimensional data indexing with iDistance
    Schuh, Michael A.
    Angryk, Rafal A.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3593 - 3600
  • [38] Efficient high-dimensional indexing by sorting principal component
    Cui, Jiangtao
    Zhou, Shuisheng
    Sun, Junding
    PATTERN RECOGNITION LETTERS, 2007, 28 (16) : 2412 - 2418
  • [39] High-Dimensional Approximate r-Nets
    Avarikioti, Z.
    Emiris, I. Z.
    Kavouras, L.
    Psarros, I
    ALGORITHMICA, 2020, 82 (06) : 1675 - 1702
  • [40] High-dimensional approximate r-nets
    Avarikioti, Georgia
    Emiris, Ioannis Z.
    Kavouras, Loukas
    Psarros, Ioannis
    PROCEEDINGS OF THE TWENTY-EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2017, : 16 - 30