An approach of performance comparisons with OpenMP and CUDA parallel programming on multicore systems

被引:3
|
作者
Chang, Chih-Hung [1 ]
Lu, Chih-Wei [1 ]
Yang, Chao-Tung [2 ]
Chang, Tzu-Chieh [2 ]
机构
[1] Hsiuping Univ Sci & Technol, Dept Informat Management, Taichung, Taiwan
[2] Tunghai Univ, Dept Comp Sci, Taichung, Taiwan
来源
关键词
auto-parallel; parallel programming; multicore; OpenMP; CUDA;
D O I
10.1002/cpe.3829
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the past, the tenacious semiconductor problems of operating temperature and power consumption limited the performance growth for single-core microprocessors. Microprocessor vendors hence adopt the multicore chip organizations with parallel processing because the new technology promises faster and lower power needed. In a short time, this trend floods first the development of CPU, then also the other peripherals like GPU. Modern GPUs are very efficient in manipulating computer graphics, and their highly parallel structure makes them even more effective than general-purpose CPUs for a range of graphical complex algorithms. However, technology of multicore processor brought revolution and unavoidable collision to the programming personnel. Multicore processor has high performance; however, parallel processing brings not only the opportunity but also a challenge. The issue of efficiency and the way how programmer or compiler parallelizes the software explicitly are the keys that enhance the performance on multicore chip. In this paper, we propose a parallel programming approach using hybrid CUDA, OpenMP, and MPI programming. There would be two verificational experiments presented in the paper. In the first, we would verify the availability and correctness of the auto-parallel tools, and discuss the performance issues on CPU, GPU, and embedded system. In the second, we would verify how the hybrid programming could surely improve performance. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:4230 / 4245
页数:16
相关论文
共 50 条
  • [1] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
    Yang, Chao-Tung
    Huang, Chih-Lin
    Lin, Cheng-Fang
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
  • [2] Teaching High-performance Computing Systems - A Case Study with Parallel Programming APIs: MPI, OpenMP and CUDA
    Czarnul, Pawel
    Matuszek, Mariusz
    Krzywaniak, Adam
    [J]. COMPUTATIONAL SCIENCE, ICCS 2024, PT VII, 2024, 14838 : 398 - 412
  • [3] Enhancing OpenMP and Its Implementation for Programming Multicore Systems
    Chapman, Barbara
    Huang, Lei
    [J]. PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 3 - +
  • [4] Research on OpenMP Model of the Parallel Programming Technology for Homogeneous Multicore DSP
    Wu, Minjie
    Wu, Weiwei
    Tai, Ning
    Zhao, Hongyu
    Fan, Jiawu
    Yuan, Naichang
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 921 - 924
  • [5] Performance Analysis of Alternative Open Source Parallel Computing Approach to OpenMP on Multicore Processors
    Chickerur, Satyadhyan
    Rayudu, Dadi Mohan Krishna
    Hiriyannaiah, Srinidhi
    Shabalina, Olga
    [J]. KNOWLEDGE-BASED SOFTWARE ENGINEERING, JCKBSE 2014, 2014, 466 : 466 - 476
  • [6] Real-time parallel image processing applications on multicore CPUs with OpenMP and GPGPU with CUDA
    Aydin, Semra
    Samet, Refik
    Bay, Omer Faruk
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (06): : 2255 - 2275
  • [7] Real-time parallel image processing applications on multicore CPUs with OpenMP and GPGPU with CUDA
    Semra Aydin
    Refik Samet
    Omer Faruk Bay
    [J]. The Journal of Supercomputing, 2018, 74 : 2255 - 2275
  • [8] Performance Evaluation of Hybrid Parallel Computing for WRF Model with CUDA and OpenMP
    Ridwan, Ridwan
    Kistijantoro, Achmad Imam
    Kudsy, Mahally
    Gunawan, Djoko
    [J]. 2015 3rd International Conference on Information and Communication Technology (ICoICT), 2015, : 425 - 430
  • [9] Parallel LDPC decoding using CUDA and OpenMP
    Joo-Yul Park
    Ki-Seok Chung
    [J]. EURASIP Journal on Wireless Communications and Networking, 2011
  • [10] Parallel LDPC decoding using CUDA and OpenMP
    Park, Joo-Yul
    Chung, Ki-Seok
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2011,