Combining CPU and GPU architectures for fast similarity search

被引:0
|
作者
Martin Kruliš
Tomáš Skopal
Jakub Lokoč
Christian Beecks
机构
[1] Charles University in Prague,SIRET Research Group, Faculty of Mathematics and Physics
[2] RWTH Aachen University,Data Management and Data Exploration Group
来源
关键词
Similarity search; Database indexing; Parallel computing; GPU; Pivot table; Metric; Ptolemaic; Multimedia databases;
D O I
暂无
中图分类号
学科分类号
摘要
The Signature Quadratic Form Distance on feature signatures represents a flexible distance-based similarity model for effective content-based multimedia retrieval. Although metric indexing approaches are able to speed up query processing by two orders of magnitude, their applicability to large-scale multimedia databases containing billions of images is still a challenging issue. In this paper, we propose a parallel approach that balances the utilization of CPU and many-core GPUs for efficient similarity search with the Signature Quadratic Form Distance. In particular, we show how to process multiple distance computations and other parts of the search procedure in parallel, achieving maximal performance of the combined CPU/GPU system. The experimental evaluation demonstrates that our approach implemented on a common workstation with 2 GPU cards outperforms traditional parallel implementation on a high-end 48-core NUMA server in terms of efficiency almost by an order of magnitude. If we consider also the price of the high-end server that is ten times higher than that of the GPU workstation then, based on price/performance ratio, the GPU-based similarity search beats the CPU-based solution by almost two orders of magnitude. Although proposed for the SQFD, our approach of fast GPU-based similarity search is applicable for any distance function that is efficiently parallelizable in the SIMT execution model.
引用
收藏
页码:179 / 207
页数:28
相关论文
共 50 条
  • [21] Randomized Algorithms Accelerated over CPU-GPU for Ultra-High Dimensional Similarity Search
    Wang, Yiqiu
    Shrivastava, Anshumali
    Wang, Jonathan
    Ryu, Junghee
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 889 - 903
  • [22] On Combining Wavefront and Tile Parallelism with a Novel GPU-Friendly Fast Search
    Papaioannou, Georgios I.
    Koziri, Maria
    Loukopoulos, Thanasis
    Anagnostopoulos, Ioannis
    ELECTRONICS, 2023, 12 (10)
  • [23] Distributed Out-of-Memory SVD on CPU/GPU Architectures
    Boureima, Ismael
    Bhattarai, Manish
    Eren, Maksim E.
    Solovyev, Nick
    Djidjev, Hristo
    Alexandrov, Boian S.
    2022 IEEE HIGH PERFORMANCE EXTREME COMPUTING VIRTUAL CONFERENCE (HPEC), 2022,
  • [24] Benchmarking JPEG 2000 implementations on modern CPU and GPU architectures
    Ciznicki, Milosz
    Kierzynka, Michal
    Kopta, Piotr
    Kurowski, Krzysztof
    Gepner, Pawel
    JOURNAL OF COMPUTATIONAL SCIENCE, 2014, 5 (02) : 90 - 98
  • [25] Distributed out-of-memory NMF on CPU/GPU architectures
    Boureima, Ismael
    Bhattarai, Manish
    Eren, Maksim
    Skau, Erik
    Romero, Philip
    Eidenbenz, Stephan
    Alexandrov, Boian
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (03): : 3970 - 3999
  • [26] Roofline Performance Analysis of DNN Architectures on CPU and GPU Systems
    Prashanth, H. C.
    Rao, Madhav
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [27] Dopia: Online Parallelism Management for Integrated CPU/GPU Architectures
    Cho, Younghyun
    Park, Jiyeon
    Negele, Florian
    Jo, Changyeon
    Gross, Thomas R.
    Egger, Bernhard
    PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 32 - 45
  • [28] Distributed out-of-memory NMF on CPU/GPU architectures
    Ismael Boureima
    Manish Bhattarai
    Maksim Eren
    Erik Skau
    Philip Romero
    Stephan Eidenbenz
    Boian Alexandrov
    The Journal of Supercomputing, 2024, 80 : 3970 - 3999
  • [29] Evaluation of CPU and GPU architectures for spectral image analysis algorithms
    Fresse, Virginie
    Houzet, Dominique
    Gravier, Christophe
    PARALLEL PROCESSING FOR IMAGING APPLICATIONS, 2011, 7872
  • [30] Fast computational techniques for modeling RFX-mod fusion devices on hybrid CPU-GPU architectures
    Abate, Domenico
    Carpentieri, Bruno
    Chiariello, Andrea G.
    Marchiori, Giuseppe
    Marconato, Nicolo
    Mastrostefano, Stefano
    Rubinacci, Guglielmo
    Ventre, Salvatore
    Villone, Fabio
    2016 URSI INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC THEORY (EMTS), 2016, : 193 - 196