Parallelization and Optimization of SIFT on GPU Using CUDA

被引:12
|
作者
Zhou, Yonglong [1 ]
Mei, Kuizhi [1 ]
Ji, Xiang [1 ]
Dong, Peixiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Peoples R China
关键词
D O I
10.1109/HPCC.and.EUC.2013.192
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scale-invariant feature transform (SIFT) based feature extraction algorithm is widely applied to extract features from images, and it is very attractive to accelerate these SIFT based algorithms on GPU. In this paper, we present several parallel computing strategies, implement and optimize the SIFT algorithm using CUDA programming model on GPU. Each stage of SIFT is analyzed in detail to choose the parallel strategy. On the basis of the elementary CUDA-SIFT and CUDA architecture, we optimize the implementation from several aspects to speedup the CUDA-SIFT. Experimental results demonstrate that our implementation after optimization is 2.5 times faster than previous optimization, and our CUDA based SIFT can run at the speed of 20 frames per second on most images with 1280x960 resolution in the test. Using 1920x1440 image to test, we have obtained a speed of 11 frames per second on average, which is about 60 times faster than the CPU implementation of SIFT. In short, our implementation obtains appropriate accuracy and higher efficiency compared to CPU implementations and other GPU implementations, which is attributed to our dedicated optimization strategies.
引用
收藏
页码:1351 / 1358
页数:8
相关论文
共 50 条
  • [1] GPU Parallelization of a Hybrid Pseudospectral Geophysical Turbulence Framework Using CUDA
    Rosenberg, Duane
    Mininni, Pablo D.
    Reddy, Raghu
    Pouquet, Annick
    [J]. ATMOSPHERE, 2020, 11 (02)
  • [2] Parallelization of Binary and Real-Coded Genetic Algorithms on GPU using CUDA
    Arora, Ramnik
    Tulshyan, Rupesh
    Deb, Kalyanmoy
    [J]. 2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [3] The Bernstein algorithm using the modified implicit Bernstein form and its GPU parallelization using CUDA
    Dhabe P.S.
    Nataraj P.S.V.
    [J]. International Journal of System Assurance Engineering and Management, 2017, 8 (04) : 826 - 841
  • [4] Using CUDA GPU to Accelerate the Ant Colony Optimization Algorithm
    Wei, Kai-Cheng
    Wu, Chao-Chin
    Wu, Chien-Ju
    [J]. 2013 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2013, : 90 - 95
  • [5] Improving Ant Colony Optimization performance on the GPU using CUDA
    Dawson, Laurence
    Stewart, Iain
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1901 - 1908
  • [6] Parallelization of Dynamic Programming in Nussinov RNA Folding Algorithm on the CUDA GPU
    Stojanovski, Marina Zaharieva
    Gjorgjevikj, Dejan
    Madjarov, Gjorgji
    [J]. ICT INNOVATIONS 2011, 2011, 150 : 279 - +
  • [7] Parallelization of calculations using GPU in optimization approach for macromodels construction
    Stakhiv, Petro
    Strubytska, Iryna
    Kozak, Yuriy
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (3A): : 7 - 9
  • [8] Parallelization of the Ant Colony Optimization for the Shortest Path Problem using OpenMP and CUDA
    Arnautovic, Maida
    Curic, Maida
    Dolamic, Emina
    Nosovic, Novica
    [J]. 2013 36TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2013, : 1273 - 1277
  • [9] Performance Evaluation of Optimization Algorithms based on GPU using CUDA Architecture
    Kawano, Yunkio
    Valdez, Fevrier
    Castillo, Oscar
    [J]. 2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,
  • [10] Optimization Principles and Application Performance Evaluation of a Multithreaded GPU Using CUDA
    Ryoo, Shane
    Rodrigues, Christopher I.
    Baghsorkhi, Sara S.
    Stone, Sam S.
    Kirk, David B.
    Hwu, Wen-mei W.
    [J]. PPOPP'08: PROCEEDINGS OF THE 2008 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2008, : 73 - 82