Performance comparison of designated preprocessing white light interferometry algorithms on emerging multi- and many-core architectures

被引:3
|
作者
Schneider, Max [1 ]
Fey, Dietmar [1 ]
Kapusi, Daniel [4 ]
Machleidt, Torsten [2 ,3 ]
机构
[1] Univ Erlangen Nurnberg, Chair Comp Sci Computer Architecture 3, Erlangen, Germany
[2] Tech Univ Ilmenau, Comp Graph Grp, Ilmenau, Germany
[3] GBS mbH, Ilmenau, Germany
[4] ZBS e V, Ilmenau, Germany
关键词
White light interferometry; Preprocessing Algorithms; OpenMP; IBM Cell BE; CUDA; GPGPU;
D O I
10.1016/j.procs.2011.04.222
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel computing has been a niche for scientific research in academia for decades. However, as common industrial applications become more and more performance demanding and raising the clock frequency of conventional single-core systems is hardly an option due to reaching technological limitations, efficient use of multi-core CPUs has become imperative. 3D surface analysis of objects using the white light interferometry presents one of such computationally challenging applications. In this article three established preprocessing methods of white light interferometry data analysis are used to evaluate the suitability of three modern multi-core architectures - generic multi-core CPUs, GPGPUs and IBM's Cell BE. The results show that function offloading to GPGPUs, which offer independent memory and many hundreds of threads running in parallel, yields the highest performance compared to other systems. Furthermore, by outsourcing computational tasks to GPUs, the workload of other system resources, such as CPU or system memory, is reduced. This allows accelerated execution of other tasks, e. g. acquisition of images with higher frame rates.
引用
收藏
页码:2037 / 2046
页数:10
相关论文
共 46 条
  • [41] Performance analysis of a 3D unstructured mesh hydrodynamics code on multi-core and many-core architectures
    Waltz, J.
    Wohlbier, J. G.
    Risinger, L. D.
    Canfield, T. R.
    Charest, M. R. J.
    Long, A. R.
    Morgan, N. R.
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2015, 77 (06) : 319 - 333
  • [42] Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors
    Park, Jongsoo
    Smelyanskiy, Mikhail
    Vaidyanathan, Karthikeyan
    Heinecke, Alexander
    Kalamkar, Dhiraj D.
    Patwary, Md Mosotofa Ali
    Pirogov, Vadim
    Dubey, Pradeep
    Liu, Xing
    Rosales, Carlos
    Mazauric, Cyril
    Daley, Christopher
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (01): : 11 - 27
  • [43] Extended Performance Analysis of the Time Predictable On-demand Coherent Data Cache for Multi- and Many-core Systems
    Pyka, Arthur
    Rohde, Mathias
    Uhrig, Sascha
    2014 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XIV), 2014, : 107 - 114
  • [44] A simulation suite for Lattice-Boltzmann based real-time CFD applications exploiting multi-level parallelism on modern multi- and many-core architectures
    Geveler, Markus
    Ribbrock, Dirk
    Mallach, Sven
    Goeddeke, Dominik
    JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (02) : 113 - 123
  • [45] Introducing multi-level parallelism, at coarse, fine and instruction level to enhance the performance of iterative solvers for large sparse linear systems on Multi- and Many-core architecture
    Gratien, Jean-Marc
    PROCEEDINGS OF SIXTH WORKSHOP ON THE LLVM COMPILER INFRASTRUCTURE IN HPC AND WORKSHOP ON HIERARCHICAL PARALLELISM FOR EXASCALE COMPUTING (LLVM-HPC2020 AND HIPAR 2020), 2020, : 85 - 95
  • [46] Portable multi- and many-core performance for finite-difference or finite-element codes - application to the free-surface component of NEMO (NEMOLite2D 1.0)
    Porter, Andrew R.
    Appleyard, Jeremy
    Ashworth, Mike
    Ford, Rupert W.
    Holt, Jason
    Liu, Hedong
    Riley, Graham D.
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2018, 11 (08) : 3447 - 3464