NAS Parallel Benchmarks with Python']Python: a performance and programming effort analysis focusing on GPUs

被引:1
|
作者
Di Domenico, Daniel [1 ]
Lima, Joao V. F. [2 ]
Cavalheiro, Gerson G. H. [1 ]
机构
[1] Univ Fed Pelotas, Pelotas, RS, Brazil
[2] Univ Fed Santa Maria, Santa Maria, RS, Brazil
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 08期
关键词
NPB; GPU; !text type='Python']Python[!/text; Numba; Programming effort;
D O I
10.1007/s11227-022-04932-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Compiled low-level languages, such as C/C++ and Fortran, have been employed as programming tools to implement applications to explore GPU devices. As a counterpoint to that trend, this paper presents a performance and programming effort analysis with Python, an interpreted and high-level language, which was applied to develop the kernels and applications of NAS Parallel Benchmarks targeting GPUs. We used Numba environment to enable CUDA support in Python, a tool that allows us to implement the GPU programs with pure Python code. Our experimental results showed that Python applications reached a performance similar to C++ programs employing CUDA and better than C++ using OpenACC for most NPB benchmarks. Furthermore, Python codes demanded less operations related to the GPU framework than CUDA, mainly because Python needs a lower number of statements to manage memory allocations and data transfers. Despite that, our Python implementations required more operations than OpenACC ones.
引用
下载
收藏
页码:8890 / 8911
页数:22
相关论文
共 50 条
  • [21] Analysis of Student Misconceptions using Python']Python as an Introductory Programming Language
    Johnson, Fionnuala
    McQuistin, Stephen
    O'Donnell, John
    PROCEEDINGS OF THE 4TH CONFERENCE ON COMPUTING EDUCATION PRACTICE, CEP 2020, 2020,
  • [22] Python']Python Programming Language for Power System Analysis Education and Research
    Fernandes, Thiago R.
    Fernandes, Leonardo R.
    Ricciardi, Tiago R.
    Ugarte, Luis F.
    de Almeida, Madson C.
    PROCEEDINGS OF THE 2018 IEEE PES TRANSMISSION & DISTRIBUTION CONFERENCE AND EXHIBITION - LATIN AMERICA (T&D-LA), 2018,
  • [23] Performance Characterization of Python Runtimes for Multi-device Task Parallel Programming
    William Ruys
    Hochan Lee
    Bozhi You
    Shreya Talati
    Jaeyoung Park
    James Almgren-Bell
    Yineng Yan
    Milinda Fernando
    Mattan Erez
    Milos Gligoric
    Martin Burtscher
    Christopher J. Rossbach
    Keshav Pingali
    George Biros
    International Journal of Parallel Programming, 2025, 53 (2)
  • [24] Analysis Tools for the VyPR Performance Analysis Framework for Python']Python
    Dawes, Joshua Heneage
    Han, Marta
    Reger, Giles
    Franzoni, Giovanni
    Pfeiffer, Andreas
    24TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2019), 2020, 245
  • [25] Scalable Multimedia Content Analysis on Parallel Platforms Using Python']Python
    Gonina, Ekaterina
    Friedland, Gerald
    Battenberg, Eric
    Koanantakool, Penporn
    Driscoll, Michael
    Georganas, Evangelos
    Keutzer, Kurt
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 10 (02)
  • [26] A Comparative Evaluation of Parallel Programming Python']Python Tools for Particle-in-Cell on Symmetric Multiprocessors
    Oscar Blandino, H.
    Meneses, Esteban
    HIGH PERFORMANCE COMPUTING, CARLA 2022, 2022, 1660 : 1 - 15
  • [27] Performance of the NAS parallel benchmarks on grid enabled clusters
    Sokolowski, PJ
    Grosu, D
    THIRD IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS, PROCEEDINGS, 2004, : 356 - 361
  • [28] A Python']Python-based programming language for high-performance computational genomics
    Shajii, Ariya
    Numanagic, Ibrahim
    Leighton, Alexander T.
    Greenyer, Haley
    Amarasinghe, Saman
    Berger, Bonnie
    NATURE BIOTECHNOLOGY, 2021, 39 (09) : 1062 - 1064
  • [29] A Rigorous Benchmarking and Performance Analysis Methodology for Python']Python Workloads
    Crape, Arthur
    Eeckhout, Lieven
    2020 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2020), 2020, : 83 - 93
  • [30] Performance Analysis of Parallel Master-Slave Evolutionary Strategies (μ,λ) Model Python']Python Implementation for CPU and GPGPU
    Zubanovic, D.
    Hidic, A.
    Hajdarevic, A.
    Nosovic, N.
    Konjicija, S.
    2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1609 - 1613