NAS Parallel Benchmarks with Python']Python: a performance and programming effort analysis focusing on GPUs

被引:1
|
作者
Di Domenico, Daniel [1 ]
Lima, Joao V. F. [2 ]
Cavalheiro, Gerson G. H. [1 ]
机构
[1] Univ Fed Pelotas, Pelotas, RS, Brazil
[2] Univ Fed Santa Maria, Santa Maria, RS, Brazil
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 08期
关键词
NPB; GPU; !text type='Python']Python[!/text; Numba; Programming effort;
D O I
10.1007/s11227-022-04932-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Compiled low-level languages, such as C/C++ and Fortran, have been employed as programming tools to implement applications to explore GPU devices. As a counterpoint to that trend, this paper presents a performance and programming effort analysis with Python, an interpreted and high-level language, which was applied to develop the kernels and applications of NAS Parallel Benchmarks targeting GPUs. We used Numba environment to enable CUDA support in Python, a tool that allows us to implement the GPU programs with pure Python code. Our experimental results showed that Python applications reached a performance similar to C++ programs employing CUDA and better than C++ using OpenACC for most NPB benchmarks. Furthermore, Python codes demanded less operations related to the GPU framework than CUDA, mainly because Python needs a lower number of statements to manage memory allocations and data transfers. Despite that, our Python implementations required more operations than OpenACC ones.
引用
收藏
页码:8890 / 8911
页数:22
相关论文
共 50 条
  • [31] OMB-Py: Python']Python Micro-Benchmarks for Evaluating Performance of MPI Libraries on HPC Systems
    Alnaasan, Nawras
    Jain, Arpan
    Shafi, Aamir
    Subramoni, Hari
    Panda, Dhabaleswar K.
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 870 - 879
  • [32] Investigating Read Performance of Python']Python and NetCDF When Using HPC Parallel Filesystems
    Jones, Matthew
    Blower, Jon
    Lawrence, Bryan
    Osprey, Annette
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 153 - 168
  • [33] Statistical analysis of NAS parallel benchmarks and LINPACK results
    Simon, HD
    Strohmaier, E
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 626 - 633
  • [34] Using NAS Parallel Benchmarks to Evaluate HPC Performance in Clouds
    Okada, Thiago Kenji
    Goldman, Alfredo
    Cavalheiro, Gerson Geraldo H.
    15TH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (IEEE NCA 2016), 2016, : 27 - 30
  • [35] A Spanish Chemoinformatics GitBook for Chemical Data Retrieval and Analysis Using Python']Python Programming
    Saldivar-Gonzalez, Fernanda I.
    Prado-Romero, Diana L.
    Cedillo-Gonzalez, Raziel
    Chavez-Hernandez, Ana L.
    Avellaneda-Tamayo, Juan F.
    Gomez-Garcia, Alejandro
    Juarez-Rivera, Luis
    Medina-Franco, Jose L.
    JOURNAL OF CHEMICAL EDUCATION, 2024, 101 (06) : 2549 - 2554
  • [36] Analysis of Cartographic Generalization based on PYTHON']PYTHON Programming Language on Digital Topographic Maps
    Stojanovic, Marko
    Drobnjak, Sinisa
    Jovanovic, Jasmina M.
    Galjak, Nenad
    Vucicevic, Ana
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON GEOGRAPHICAL INFORMATION SYSTEMS THEORY, APPLICATIONS AND MANAGEMENT (GISTAM), 2020, : 191 - 198
  • [37] Performance characteristics of the multi-zone NAS parallel benchmarks
    Jin, HQ
    Van der Wijngaart, RF
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2006, 66 (05) : 674 - 685
  • [38] Performance Evaluation of NAS Parallel Benchmarks on Intel® Xeon Phi™
    Ramachandran, Arunmoezhi
    Vienne, Jerome
    Van der Wijngaart, Rob
    Koesterke, Lars
    Sharapov, Ilya
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 736 - 743
  • [39] GPU-aware Communication with UCX in Parallel Programming Models: Charm plus plus , MPI, and Python']Python
    Choi, Jaemin
    Fink, Zane
    White, Sam
    Bhat, Nitin
    Richards, David F.
    Kale, Laxmikant, V
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 479 - 488
  • [40] PERFORMANCE OF THE NAS PARALLEL BENCHMARKS ON PVM-BASED NETWORKS
    WHITE, S
    ALUND, A
    SUNDERAM, VS
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1995, 26 (01) : 61 - 71