Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization

被引：0

作者：

Ahmed Elliethy

Gaurav Sharma

机构：

[1] University of Rochester,Department of Electrical and Computer Engineering

来源：

Journal of Real-Time Image Processing | 2019年 / 16卷

关键词：

Chamfer alignment; Pipelining; Parametric registration; GPU acceleration;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Parametric chamfer alignment (PChA) is commonly employed for aligning an observed set of points with a corresponding set of reference points. PChA estimates optimal geometric transformation parameters that minimize an objective function formulated as the sum of the squared distances from each transformed observed point to its closest reference point. A distance transform enables efficient computation of the (squared) distances, and the objective function minimization is commonly performed via the Levenberg–Marquardt (LM) nonlinear least squares iterative optimization algorithm. The point-wise computations of the objective function, gradient, and Hessian approximation required for the LM iterations make PChA computationally demanding for large-scale datasets. We propose an acceleration of the PChA via a parallelized and pipelined realization that is particularly well suited for large-scale datasets and for modern GPU architectures. Specifically, we partition the observed points among the GPU blocks and decompose the expensive LM calculations in correspondence with the GPU’s single-instruction multiple-thread architecture to significantly speed up this bottleneck step for PChA on large-scale datasets. Additionally, by reordering computations, we propose a novel pipelining of the LM algorithm that offers further speedup by exploiting the low arithmetic latency of the GPU compared with its high global memory access latency. Results obtained on two different platforms for both 2D and 3D large-scale point datasets from our ongoing research demonstrate that the proposed PChA GPU implementation provides a significant speedup over its single CPU counterpart.

引用

页码：1661 / 1680

页数：19

共 50 条

[21] GPU-Accelerated Protein Sequence Alignment for Jamu Prediction
Iryanto, Syam B.
Kusuma, Wisnu A.
Sadikin, Rifki
Swardiana, I. Wayan A.
2017 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2017, : 132 - 136
[22] GPU Accelerated Scalable Parallel Decoding of LDPC Codes
Wang, Guohui
Wu, Michael
Sun, Yang
Cavallaro, Joseph R.
2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 2053 - 2057
[23] GPU-accelerated parallel optimization for sparse regularization
Wang, Xingran
Liu, Tianyi
Minh Trinh-Hoang
Pesavento, Marius
2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
[24] GPU Accelerated Molecular Docking with Parallel Genetic Algorithm
Ouyang, Xuchang
Kwoh, Chee Keong
PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, : 694 - 695
[25] GPU-accelerated parallel algorithms for linear rankSVM
Jing Jin
Xianggao Cai
Guoming Lai
Xiaola Lin
The Journal of Supercomputing, 2015, 71 : 4141 - 4171
[26] GPU-accelerated parallel algorithms for linear rankSVM
Jin, Jing
Cai, Xianggao
Lai, Guoming
Lin, Xiaola
JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4141 - 4171
[27] Impact of Asynchronism on GPU Accelerated Parallel Iterative Computations
Contassot-Vivier, Sylvain
Jost, Thomas
Vialle, Stephane
APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 43 - 53
[28] Accelerated GPU Computing Technology for Parallel Management Systems
Cui, Feng
Cheng, Changjian
Wang, Feiyue
Wei, Wei
Li, Lefei
Zou, Yumin
2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5343 - 5347
[29] GPU Accelerated Parallel Implementation of Linear Programming Algorithms
Saha, Ratul Kishore
Pradhan, Ashutosh
Ghosh, Tiash
Jenamani, Mamata
Singh, Sanjai Kumar
Routray, Aurobinda
INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 378 - 384
[30] Parallel processing for accelerated mean shift algorithm with GPU
Chen, Jia
Wu, Xiaojun
Cai, Rong
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2010, 22 (03): : 461 - 466

← 1 2 3 4 5 →