Parallelization and Optimization of SIFT on GPU Using CUDA

被引:12
|
作者
Zhou, Yonglong [1 ]
Mei, Kuizhi [1 ]
Ji, Xiang [1 ]
Dong, Peixiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Peoples R China
关键词
D O I
10.1109/HPCC.and.EUC.2013.192
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scale-invariant feature transform (SIFT) based feature extraction algorithm is widely applied to extract features from images, and it is very attractive to accelerate these SIFT based algorithms on GPU. In this paper, we present several parallel computing strategies, implement and optimize the SIFT algorithm using CUDA programming model on GPU. Each stage of SIFT is analyzed in detail to choose the parallel strategy. On the basis of the elementary CUDA-SIFT and CUDA architecture, we optimize the implementation from several aspects to speedup the CUDA-SIFT. Experimental results demonstrate that our implementation after optimization is 2.5 times faster than previous optimization, and our CUDA based SIFT can run at the speed of 20 frames per second on most images with 1280x960 resolution in the test. Using 1920x1440 image to test, we have obtained a speed of 11 frames per second on average, which is about 60 times faster than the CPU implementation of SIFT. In short, our implementation obtains appropriate accuracy and higher efficiency compared to CPU implementations and other GPU implementations, which is attributed to our dedicated optimization strategies.
引用
收藏
页码:1351 / 1358
页数:8
相关论文
共 50 条
  • [21] cuPSO: GPU Parallelization for Particle Swarm Optimization Algorithms
    Wang, Chuan-Chi
    Ho, Chun-Yen
    Tu, Chia-Heng
    Hung, Shih-Hao
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1183 - 1189
  • [22] GPU-Based Parallelization for Fast Circuit Optimization
    Liu, Yifang
    Hu, Jiang
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2011, 16 (03)
  • [23] SIFT Implementation and Optimization for General-Purpose GPU
    Heymann, S.
    Mueller, K.
    Smolic, A.
    Froelich, B.
    Wiegand, T.
    WSCG 2007, FULL PAPERS PROCEEDINGS I AND II, 2007, : 317 - +
  • [24] Accelerating Ant Colony Optimization-based Edge Detection on the GPU using CUDA
    Dawson, Laurence
    Stewart, Iain A.
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1736 - 1743
  • [25] Multiple string matching on a GPU using CUDA
    Kouzinopoulos, Charalampos S.
    Michailidis, Panagiotis D.
    Margaritis, Konstantinos G.
    Scalable Computing, 2015, 16 (02): : 121 - 137
  • [26] Canny Edge Detection on GPU using CUDA
    Horvath, Matthew, Jr.
    Bowers, Michael
    Alawneh, Shadi
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 419 - 425
  • [27] MULTIPLE STRING MATCHING ON A GPU USING CUDA
    Kouzinopoulos, Charalampos S.
    Michailidis, Panagiotis D.
    Margaritis, Konstantinos G.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2015, 16 (02): : 121 - 137
  • [28] GPU Acceleration using CUDA for Computational Electromagnetics
    Sideris, Constantine
    2024 INTERNATIONAL APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY SYMPOSIUM, ACES 2024, 2024,
  • [29] Singular Value Decomposition on GPU using CUDA
    Lahabar, Sheetal
    Narayanan, P. J.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 840 - 849
  • [30] GPU Acceleration of PROPELLER MRI Using CUDA
    Guo, Hongyu
    Dai, Jianping
    Guo, Hongyu
    He, Yanfa
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 2051 - +