Performance comparison of designated preprocessing white light interferometry algorithms on emerging multi- and many-core architectures

被引：3

作者：

Schneider, Max ^{[1
]}

Fey, Dietmar ^{[1
]}

Kapusi, Daniel ^{[4
]}

Machleidt, Torsten ^{[2
,3
]}

机构：

[1] Univ Erlangen Nurnberg, Chair Comp Sci Computer Architecture 3, Erlangen, Germany

[2] Tech Univ Ilmenau, Comp Graph Grp, Ilmenau, Germany

[3] GBS mbH, Ilmenau, Germany

[4] ZBS e V, Ilmenau, Germany

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS) | 2011年 / 4卷

关键词：

White light interferometry; Preprocessing Algorithms; OpenMP; IBM Cell BE; CUDA; GPGPU;

D O I：

10.1016/j.procs.2011.04.222

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Parallel computing has been a niche for scientific research in academia for decades. However, as common industrial applications become more and more performance demanding and raising the clock frequency of conventional single-core systems is hardly an option due to reaching technological limitations, efficient use of multi-core CPUs has become imperative. 3D surface analysis of objects using the white light interferometry presents one of such computationally challenging applications. In this article three established preprocessing methods of white light interferometry data analysis are used to evaluate the suitability of three modern multi-core architectures - generic multi-core CPUs, GPGPUs and IBM's Cell BE. The results show that function offloading to GPGPUs, which offer independent memory and many hundreds of threads running in parallel, yields the highest performance compared to other systems. Furthermore, by outsourcing computational tasks to GPUs, the workload of other system resources, such as CPU or system memory, is reduced. This allows accelerated execution of other tasks, e. g. acquisition of images with higher frame rates.

引用

页码：2037 / 2046

页数：10

共 46 条

[41] Performance analysis of a 3D unstructured mesh hydrodynamics code on multi-core and many-core architectures
Waltz, J.
Wohlbier, J. G.
Risinger, L. D.
Canfield, T. R.
Charest, M. R. J.
Long, A. R.
Morgan, N. R.
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2015, 77 (06) : 319 - 333
[42] Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors
Park, Jongsoo
Smelyanskiy, Mikhail
Vaidyanathan, Karthikeyan
Heinecke, Alexander
Kalamkar, Dhiraj D.
Patwary, Md Mosotofa Ali
Pirogov, Vadim
Dubey, Pradeep
Liu, Xing
Rosales, Carlos
Mazauric, Cyril
Daley, Christopher
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (01): : 11 - 27
[43] Extended Performance Analysis of the Time Predictable On-demand Coherent Data Cache for Multi- and Many-core Systems
Pyka, Arthur
Rohde, Mathias
Uhrig, Sascha
2014 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XIV), 2014, : 107 - 114
[44] A simulation suite for Lattice-Boltzmann based real-time CFD applications exploiting multi-level parallelism on modern multi- and many-core architectures
Geveler, Markus
Ribbrock, Dirk
Mallach, Sven
Goeddeke, Dominik
JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (02) : 113 - 123
[45] Introducing multi-level parallelism, at coarse, fine and instruction level to enhance the performance of iterative solvers for large sparse linear systems on Multi- and Many-core architecture
Gratien, Jean-Marc
PROCEEDINGS OF SIXTH WORKSHOP ON THE LLVM COMPILER INFRASTRUCTURE IN HPC AND WORKSHOP ON HIERARCHICAL PARALLELISM FOR EXASCALE COMPUTING (LLVM-HPC2020 AND HIPAR 2020), 2020, : 85 - 95
[46] Portable multi- and many-core performance for finite-difference or finite-element codes - application to the free-surface component of NEMO (NEMOLite2D 1.0)
Porter, Andrew R.
Appleyard, Jeremy
Ashworth, Mike
Ford, Rupert W.
Holt, Jason
Liu, Hedong
Riley, Graham D.
GEOSCIENTIFIC MODEL DEVELOPMENT, 2018, 11 (08) : 3447 - 3464

← 1 2 3 4 5 →