Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization

被引:0
|
作者
Ahmed Elliethy
Gaurav Sharma
机构
[1] University of Rochester,Department of Electrical and Computer Engineering
来源
关键词
Chamfer alignment; Pipelining; Parametric registration; GPU acceleration;
D O I
暂无
中图分类号
学科分类号
摘要
Parametric chamfer alignment (PChA) is commonly employed for aligning an observed set of points with a corresponding set of reference points. PChA estimates optimal geometric transformation parameters that minimize an objective function formulated as the sum of the squared distances from each transformed observed point to its closest reference point. A distance transform enables efficient computation of the (squared) distances, and the objective function minimization is commonly performed via the Levenberg–Marquardt (LM) nonlinear least squares iterative optimization algorithm. The point-wise computations of the objective function, gradient, and Hessian approximation required for the LM iterations make PChA computationally demanding for large-scale datasets. We propose an acceleration of the PChA via a parallelized and pipelined realization that is particularly well suited for large-scale datasets and for modern GPU architectures. Specifically, we partition the observed points among the GPU blocks and decompose the expensive LM calculations in correspondence with the GPU’s single-instruction multiple-thread architecture to significantly speed up this bottleneck step for PChA on large-scale datasets. Additionally, by reordering computations, we propose a novel pipelining of the LM algorithm that offers further speedup by exploiting the low arithmetic latency of the GPU compared with its high global memory access latency. Results obtained on two different platforms for both 2D and 3D large-scale point datasets from our ongoing research demonstrate that the proposed PChA GPU implementation provides a significant speedup over its single CPU counterpart.
引用
收藏
页码:1661 / 1680
页数:19
相关论文
共 50 条
  • [21] GPU-Accelerated Protein Sequence Alignment for Jamu Prediction
    Iryanto, Syam B.
    Kusuma, Wisnu A.
    Sadikin, Rifki
    Swardiana, I. Wayan A.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2017, : 132 - 136
  • [22] GPU Accelerated Scalable Parallel Decoding of LDPC Codes
    Wang, Guohui
    Wu, Michael
    Sun, Yang
    Cavallaro, Joseph R.
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 2053 - 2057
  • [23] GPU-accelerated parallel optimization for sparse regularization
    Wang, Xingran
    Liu, Tianyi
    Minh Trinh-Hoang
    Pesavento, Marius
    2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [24] GPU Accelerated Molecular Docking with Parallel Genetic Algorithm
    Ouyang, Xuchang
    Kwoh, Chee Keong
    PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 694 - 695
  • [25] GPU-accelerated parallel algorithms for linear rankSVM
    Jing Jin
    Xianggao Cai
    Guoming Lai
    Xiaola Lin
    The Journal of Supercomputing, 2015, 71 : 4141 - 4171
  • [26] GPU-accelerated parallel algorithms for linear rankSVM
    Jin, Jing
    Cai, Xianggao
    Lai, Guoming
    Lin, Xiaola
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4141 - 4171
  • [27] Impact of Asynchronism on GPU Accelerated Parallel Iterative Computations
    Contassot-Vivier, Sylvain
    Jost, Thomas
    Vialle, Stephane
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 43 - 53
  • [28] Accelerated GPU Computing Technology for Parallel Management Systems
    Cui, Feng
    Cheng, Changjian
    Wang, Feiyue
    Wei, Wei
    Li, Lefei
    Zou, Yumin
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5343 - 5347
  • [29] GPU Accelerated Parallel Implementation of Linear Programming Algorithms
    Saha, Ratul Kishore
    Pradhan, Ashutosh
    Ghosh, Tiash
    Jenamani, Mamata
    Singh, Sanjai Kumar
    Routray, Aurobinda
    INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 378 - 384
  • [30] Parallel processing for accelerated mean shift algorithm with GPU
    Chen, Jia
    Wu, Xiaojun
    Cai, Rong
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (03): : 461 - 466