Hybrid OpenMP-CUDA parallel implementation of a deterministic solver for ultrashort DG-MOSFETs

被引:1
|
作者
Mantas, Jose M. [1 ]
Vecil, Francesco [2 ]
机构
[1] Univ Granada, Dept Lenguajes & Sistemas Informat, Granada, Spain
[2] Univ Clermont Auvergne, Lab Math Blaise Pascal, UFR Math, Clermont Ferrand, France
关键词
Semiconductor nano-device simulation; hybrid OpenMP-CUDA parallelization; quantum-classical dimensional coupling; Boltzmann-Schrodinger-Poisson system; parallelization of numerical algorithms; parallel heterogeneous systems; GPU computing; DOUBLE-GATE MOSFET; SEMICONDUCTOR-DEVICES; SUBBAND MODEL; SOLVING LARGE; SIMULATION; PERFORMANCE; OPTIMIZATION; SYSTEM; SPMV;
D O I
10.1177/1094342019879985
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The simulation of ultrashort two-dimensional double gate metal-oxide semiconductor field-effect transistors and similar semiconductor devices through a deterministic mesoscopic, hence accurate, model can be very useful for the industry: It can provide reference results for macroscopic solvers and properly describe weakly charged zones of the device. For the scope of this work, we use a Boltzmann-Schrodinger-Poisson model. Its drawback is being particularly costly from the computational point of view, and a purely sequential code may take weeks to simulate high voltages. In this article, we develop a hybrid parallel solver for a graphics processing unit (GPU)-based platform. In order to accelerate the simulations, the Boltzmann transport equations are solved on GPU using the CUDA programing model, while the Schrodinger-Poisson block is performed on multicore CPUs using OpenMP. We have adapted the costliest computing phases to the GPU in an efficient manner, achieving high performance and drastically reducing the simulation time. We give details about the parallel-design strategy and show the performance results.
引用
收藏
页码:81 / 102
页数:22
相关论文
共 38 条
  • [21] A hybrid MPI-OpenMP implementation of an implicit finite-element code on parallel architectures
    Mahinthakumar, G
    Saied, F
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2002, 16 (04): : 371 - 393
  • [22] A hybrid MPI/OpenMP parallel implementation of NSGA-II for finding patterns in protein sequences
    Gonzalez-Alvarez, David L.
    Vega-Rodriguez, Miguel A.
    Rubio-Largo, Alvaro
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2285 - 2312
  • [23] Performance Improvement of the Parallel Smith Waterman Algorithm Implementation Using Hybrid MPI - Openmp Model
    Khaled, Heba
    Faheem, H. M.
    Fayez, Mahmoud
    Katib, Iyad
    Aljohani, Naif R.
    PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 1232 - 1242
  • [24] A hybrid MPI/OpenMP parallel implementation of NSGA-II for finding patterns in protein sequences
    David L. González-Álvarez
    Miguel A. Vega-Rodríguez
    Álvaro Rubio-Largo
    The Journal of Supercomputing, 2017, 73 : 2285 - 2312
  • [25] A parallel iterative solver for positive-definite systems with hybrid MPI-OpenMP parallelization for multi-core clusters
    Mitin, Igor
    Kalinkin, Alexander
    Laevsky, Yuri
    JOURNAL OF COMPUTATIONAL SCIENCE, 2012, 3 (06) : 463 - 468
  • [26] A hybrid MPI-OpenMP parallel implementation for pseudospectral simulations with application to Taylor-Couette flow
    Shi, Liang
    Rampp, Markus
    Hof, Bjoern
    Avila, Marc
    COMPUTERS & FLUIDS, 2015, 106 : 1 - 11
  • [27] A hybrid CUDA, OpenMP, and MPI parallel TCA-based domain adaptation for classification of very high-resolution remote sensing images
    Alberto S. Garea
    Dora B. Heras
    Francisco Argüello
    Begüm Demir
    The Journal of Supercomputing, 2023, 79 : 7513 - 7532
  • [28] Parallel multi-GPU implementation of fast decoupled power flow solver with hybrid architecture
    Lei Zeng
    Shadi G. Alawneh
    Seyed Ali. Arefifar
    Cluster Computing, 2024, 27 : 1125 - 1136
  • [29] Parallel multi-GPU implementation of fast decoupled power flow solver with hybrid architecture
    Zeng, Lei
    Alawneh, Shadi G.
    Arefifar, Seyed Ali.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (01): : 1125 - 1136
  • [30] A hybrid CUDA, OpenMP, and MPI parallel TCA-based domain adaptation for classification of very high-resolution remote sensing images
    Garea, Alberto S.
    Heras, Dora B.
    Arguello, Francisco
    Demir, Begum
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07): : 7513 - 7532