Combining CPU and GPU architectures for fast similarity search

被引:0
|
作者
Martin Kruliš
Tomáš Skopal
Jakub Lokoč
Christian Beecks
机构
[1] Charles University in Prague,SIRET Research Group, Faculty of Mathematics and Physics
[2] RWTH Aachen University,Data Management and Data Exploration Group
来源
关键词
Similarity search; Database indexing; Parallel computing; GPU; Pivot table; Metric; Ptolemaic; Multimedia databases;
D O I
暂无
中图分类号
学科分类号
摘要
The Signature Quadratic Form Distance on feature signatures represents a flexible distance-based similarity model for effective content-based multimedia retrieval. Although metric indexing approaches are able to speed up query processing by two orders of magnitude, their applicability to large-scale multimedia databases containing billions of images is still a challenging issue. In this paper, we propose a parallel approach that balances the utilization of CPU and many-core GPUs for efficient similarity search with the Signature Quadratic Form Distance. In particular, we show how to process multiple distance computations and other parts of the search procedure in parallel, achieving maximal performance of the combined CPU/GPU system. The experimental evaluation demonstrates that our approach implemented on a common workstation with 2 GPU cards outperforms traditional parallel implementation on a high-end 48-core NUMA server in terms of efficiency almost by an order of magnitude. If we consider also the price of the high-end server that is ten times higher than that of the GPU workstation then, based on price/performance ratio, the GPU-based similarity search beats the CPU-based solution by almost two orders of magnitude. Although proposed for the SQFD, our approach of fast GPU-based similarity search is applicable for any distance function that is efficiently parallelizable in the SIMT execution model.
引用
收藏
页码:179 / 207
页数:28
相关论文
共 50 条
  • [31] Effectiveness of Fast Fourier Transform Implementations on GPU and CPU
    Puchala, Dariusz
    Stokfiszewski, Kamil
    Yatsymirskyy, Mykhaylo
    Szczepaniak, Bartlomiej
    2015 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL PROBLEMS OF ELECTRICAL ENGINEERING (CPEE), 2015, : 162 - 164
  • [32] Fast GPU and CPU computing for Head Position Estimation
    Szkudlarek, Michal
    Pietruszka, Maria
    PROCEEDINGS OF THE 2015 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 5 : 231 - 240
  • [33] Fast CPU/GPU Pattern Evaluation of Irregular Arrays
    Capozzoli, A.
    Curcio, C.
    D'Elia, G.
    Liseno, A.
    Vinetti, P.
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2010, 25 (04): : 355 - 372
  • [34] Financial applications on multi-CPU and multi-GPU architectures
    Castillo, Emilio
    Camarero, Cristobal
    Borrego, Ana
    Luis Bosque, Jose
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (02): : 729 - 739
  • [35] Financial applications on multi-CPU and multi-GPU architectures
    Emilio Castillo
    Cristóbal Camarero
    Ana Borrego
    Jose Luis Bosque
    The Journal of Supercomputing, 2015, 71 : 729 - 739
  • [36] Comparing LLC-Memory Traffic between CPU and GPU Architectures
    Monil, Mohammad Alaul Haque
    Lee, Seyong
    Vetter, Jeffrey S.
    Malony, Allen D.
    PROCEEDINGS OF RSDHA 2021: REDEFINING SCALABILITY FOR DIVERSELY HETEROGENEOUS ARCHITECTURES, 2021, : 8 - 16
  • [37] Parallel Aligning Multiple Metabolic Pathways on Hybrid CPU and GPU Architectures
    Huang, Yiran
    Zhong, Cheng
    Zhang, Jinxiong
    Li, Ye
    Liu, Jun
    PARALLEL ARCHITECTURE, ALGORITHM AND PROGRAMMING, PAAP 2017, 2017, 729 : 483 - 492
  • [38] Correction to: Distributed out-of-memory NMF on CPU/GPU architectures
    Ismael Boureima
    Manish Bhattarai
    Maksim Eren
    Erik Skau
    Philip Romero
    Stephan Eidenbenz
    Boian Alexandrov
    The Journal of Supercomputing, 2024, 80 : 5731 - 5732
  • [39] Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU Architectures
    Jung, Hanwoong
    Yi, Youngmin
    Ha, Soonhoi
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 : 579 - 588
  • [40] Understanding Co-Running Behaviors on Integrated CPU/GPU Architectures
    Zhang, Feng
    Zhai, Jidong
    He, Bingsheng
    Zhang, Shuhao
    Chen, Wenguang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (03) : 905 - 918