cuRCD: Region covariance descriptor CUDA implementation

被引:0
|
作者
M. Ali Asan
Adnan Ozsoy
机构
[1] Hacettepe University,Computer Engineering Department
来源
关键词
Parallel region covariance; CUDA; Real time object detection; GPGPU;
D O I
暂无
中图分类号
学科分类号
摘要
Abstract Region covariance is a robust feature descriptor that allows the use of even the simplest image features like intensity and gradient combined to form a well-performing descriptor for regions on the image. Beyond its robustness, it requires many identical heavy computations on different parts of input data which makes it a good candidate for parallel execution. In this manuscript, we present a real-time parallel implementation of the region covariance which, to our best knowledge, is the first in the literature. We experimented against existing implementations and achieved 6 times faster execution time over vectorized CPU parallel implementation that provides necessary speed up for real-time processing. Additionally, we improved the existing integral image calculation method on CUDA, reducing memory usage by 50%, achieving the fastest computation speed compared to exist- ing solutions, and improved the covariance matrix comparison metric by using a distance metric that is lightweight to compute and easy to implement.
引用
收藏
页码:19737 / 19751
页数:14
相关论文
共 50 条
  • [31] Salient Object Detection from Single Haze Images via Dark Channel Prior and Region Covariance Descriptor
    Mu, Nan
    Xu, Xin
    Zhang, Xiaolong
    INTELLIGENT VISUAL SURVEILLANCE (IVS 2016), 2016, 664 : 99 - 106
  • [32] Fast Generalized Fourier Descriptor for object recognition of image using CUDA
    Haythem, Bahri
    Mohamed, Hallek
    Marwa, Chouchene
    Fatma, Sayadi
    Mohamed, Atri
    2014 WORLD SYMPOSIUM ON COMPUTER APPLICATIONS & RESEARCH (WSCAR), 2014,
  • [33] CUDA implementation of fractal image compression
    Al Sideiri, Abir
    Alzeidi, Nasser
    Al Hammoshi, Mayyada
    Chauhan, Munesh Singh
    AlFarsi, Ghaliya
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (05) : 1375 - 1387
  • [34] CUDA implementation of the antlion optimization algorithm
    Davendra, Donald
    Metlicka, Magdalena
    Bialic-Davendra, Magdalena
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2023, 38 (02) : 118 - 139
  • [35] Design and Implementation of Parallel FFT on CUDA
    Zhang, Xueqin
    Shen, Kai
    Xu, Chengguang
    Wang, Kaifang
    2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC), 2013, : 583 - 589
  • [36] Streaming the Boris Pusher: a CUDA implementation
    Abreu, Paulo
    Fonseca, Ricardo
    Silva, Luis O.
    ADVANCED ACCELERATOR CONCEPTS, 2009, 1086 : 328 - 333
  • [37] HCudaBLAST: an implementation of BLAST on Hadoop and Cuda
    Khare N.
    Khare A.
    Khan F.
    Journal of Big Data, 4 (1)
  • [38] Implementation of Neural Network Backpropagation in CUDA
    Liu, Jinfeng
    Guo, Lei
    INTELLIGENCE COMPUTATION AND EVOLUTIONARY COMPUTATION, 2013, 180 : 1021 - 1027
  • [39] Implementation of medical image segmentation in CUDA
    Pan, Lei
    Gu, Lixu
    Xu, Jianrong
    2008 INTERNATIONAL SPECIAL TOPIC CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, VOLS 1 AND 2, 2008, : 94 - +
  • [40] Efficient CUDA Implementation in the DSMC Method
    Kashkovsky, A. V.
    Shershnev, A. A.
    Ivanov, M. S.
    28TH INTERNATIONAL SYMPOSIUM ON RAREFIED GAS DYNAMICS 2012, VOLS. 1 AND 2, 2012, 1501 : 511 - 518