Combining CPU and GPU architectures for fast similarity search

被引:0
|
作者
Martin Kruliš
Tomáš Skopal
Jakub Lokoč
Christian Beecks
机构
[1] Charles University in Prague,SIRET Research Group, Faculty of Mathematics and Physics
[2] RWTH Aachen University,Data Management and Data Exploration Group
来源
关键词
Similarity search; Database indexing; Parallel computing; GPU; Pivot table; Metric; Ptolemaic; Multimedia databases;
D O I
暂无
中图分类号
学科分类号
摘要
The Signature Quadratic Form Distance on feature signatures represents a flexible distance-based similarity model for effective content-based multimedia retrieval. Although metric indexing approaches are able to speed up query processing by two orders of magnitude, their applicability to large-scale multimedia databases containing billions of images is still a challenging issue. In this paper, we propose a parallel approach that balances the utilization of CPU and many-core GPUs for efficient similarity search with the Signature Quadratic Form Distance. In particular, we show how to process multiple distance computations and other parts of the search procedure in parallel, achieving maximal performance of the combined CPU/GPU system. The experimental evaluation demonstrates that our approach implemented on a common workstation with 2 GPU cards outperforms traditional parallel implementation on a high-end 48-core NUMA server in terms of efficiency almost by an order of magnitude. If we consider also the price of the high-end server that is ten times higher than that of the GPU workstation then, based on price/performance ratio, the GPU-based similarity search beats the CPU-based solution by almost two orders of magnitude. Although proposed for the SQFD, our approach of fast GPU-based similarity search is applicable for any distance function that is efficiently parallelizable in the SIMT execution model.
引用
收藏
页码:179 / 207
页数:28
相关论文
共 50 条
  • [11] Mapping Parallel Programs to Heterogeneous CPU/GPU Architectures using a Monte Carlo Tree Search
    Goli, Mehdi
    McCall, John
    Brown, Christopher
    Janjic, Vladimir
    Hammond, Kevin
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 2932 - 2939
  • [12] Scalable molecular dynamics on CPU and GPU architectures with NAMD
    Phillips, James C.
    Hardy, David J.
    Maia, Julio D. C.
    Stone, John E.
    Ribeiro, Joao, V
    Bernardi, Rafael C.
    Buch, Ronak
    Fiorin, Giacomo
    Henin, Jerome
    Jiang, Wei
    McGreevy, Ryan
    Melo, Marcelo C. R.
    Radak, Brian K.
    Skeel, Robert D.
    Singharoy, Abhishek
    Wang, Yi
    Roux, Benoit
    Aksimentiev, Aleksei
    Luthey-Schulten, Zaida
    Kale, Laxmikant, V
    Schulten, Klaus
    Chipot, Christophe
    Tajkhorshid, Emad
    JOURNAL OF CHEMICAL PHYSICS, 2020, 153 (04):
  • [13] Distributed Learning of CNNs on Heterogeneous CPU/GPU Architectures
    Marques, Jose
    Falcao, Gabriel
    Alexandre, Luis A.
    APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (9-10) : 822 - 844
  • [14] A Survey on Heterogeneous CPU-GPU Architectures and Simulators
    Alaei, Mohammad
    Yazdanpanah, Fahimeh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (01):
  • [15] Denial of Service in CPU-GPU Heterogeneous Architectures
    Wen, Hao
    Zhang, Wei
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [16] Reducing CPU-GPU Interferences to Improve CPU Performance in Heterogeneous Architectures
    Wen H.
    Zhang W.
    Journal of Computing Science and Engineering, 2020, 16 (04) : 131 - 145
  • [17] Sorting Large Datasets with Heterogeneous CPU/GPU Architectures
    Gowanlock, Michael
    Karsin, Ben
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 560 - 569
  • [18] On-The-FlyWorkload Partitioning for Integrated CPU/GPU Architectures
    Cho, Younghyun
    Negele, Florian
    Park, Seohong
    Egger, Bernhard
    Gross, Thomas R.
    27TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2018), 2018,
  • [19] A Sample-Based Dynamic CPU and GPU LLC Bypassing Method for Heterogeneous CPU-GPU Architectures
    Wang, Xin
    Zhang, Wei
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS / 11TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING / 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2017, : 753 - 760
  • [20] Fast Bed Interpolation Algoritm on CPU and GPU
    Zhang, Yaoxin
    Jia, Yafei
    WORLD ENVIRONMENTAL AND WATER RESOURCES CONGRESS 2019: HYDRAULICS, WATERWAYS, AND WATER DISTRIBUTION SYSTEMS ANALYSIS, 2019, : 208 - 220