Hybrid OpenMP-CUDA parallel implementation of a deterministic solver for ultrashort DG-MOSFETs

被引:1
|
作者
Mantas, Jose M. [1 ]
Vecil, Francesco [2 ]
机构
[1] Univ Granada, Dept Lenguajes & Sistemas Informat, Granada, Spain
[2] Univ Clermont Auvergne, Lab Math Blaise Pascal, UFR Math, Clermont Ferrand, France
关键词
Semiconductor nano-device simulation; hybrid OpenMP-CUDA parallelization; quantum-classical dimensional coupling; Boltzmann-Schrodinger-Poisson system; parallelization of numerical algorithms; parallel heterogeneous systems; GPU computing; DOUBLE-GATE MOSFET; SEMICONDUCTOR-DEVICES; SUBBAND MODEL; SOLVING LARGE; SIMULATION; PERFORMANCE; OPTIMIZATION; SYSTEM; SPMV;
D O I
10.1177/1094342019879985
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The simulation of ultrashort two-dimensional double gate metal-oxide semiconductor field-effect transistors and similar semiconductor devices through a deterministic mesoscopic, hence accurate, model can be very useful for the industry: It can provide reference results for macroscopic solvers and properly describe weakly charged zones of the device. For the scope of this work, we use a Boltzmann-Schrodinger-Poisson model. Its drawback is being particularly costly from the computational point of view, and a purely sequential code may take weeks to simulate high voltages. In this article, we develop a hybrid parallel solver for a graphics processing unit (GPU)-based platform. In order to accelerate the simulations, the Boltzmann transport equations are solved on GPU using the CUDA programing model, while the Schrodinger-Poisson block is performed on multicore CPUs using OpenMP. We have adapted the costliest computing phases to the GPU in an efficient manner, achieving high performance and drastically reducing the simulation time. We give details about the parallel-design strategy and show the performance results.
引用
收藏
页码:81 / 102
页数:22
相关论文
共 38 条
  • [1] A parallel deterministic solver for the Schrodinger-Poisson-Boltzmann system in ultra-short DG-MOSFETs: Comparison with Monte-Carlo
    Vecil, Francesco
    Mantas, Jose M.
    Caceres, Maria J.
    Sampedro, Carlos
    Godoy, Andres
    Gamiz, Francisco
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2014, 67 (09) : 1703 - 1721
  • [2] Acceleration of the Finite Element Method Using Hybrid OpenMP-CUDA
    Meng, Huan-Ting
    Nie, Bao-Lin
    Jin, Jian-Ming
    Wong, Steven
    Macon, Charles
    2014 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM (APSURSI), 2014, : 1379 - 1380
  • [3] OpenMP implementation of parallel linear solver for reservoir simulation
    Hu, Changjun
    Zhang, Jilin
    Wang, Jue
    Li, Jianjiang
    PRACTICAL PROGRAMMING MODEL FOR THE MULTI-CORE ERA, PROCEEDINGS, 2008, 4935 : 134 - 137
  • [4] A Dynamically Balanced OpenMP-CUDA Implementation of PDE-Based Contrast Source Inversion for Microwave Imaging
    Geddert, Nicholas
    Jeffrey, Ian
    2018 18TH INTERNATIONAL SYMPOSIUM ON ANTENNA TECHNOLOGY AND APPLIED ELECTROMAGNETICS (ANTEM 2018), 2018,
  • [5] An OpenMP-CUDA Implementation of Multilevel Fast Multipole Algorithm for Electromagnetic Simulation on Multi-GPU Computing Systems
    Guan, Jian
    Yan, Su
    Jin, Jian-Ming
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2013, 61 (07) : 3607 - 3616
  • [6] Parallel Implementation of FEM Solver for Shared Memory Using OpenMP
    Majeed, Rabia
    Farrukh, Ramisha
    Riaz, Omer
    Ali, Sikandar
    Samad, Ali
    Khan, Mukhtaj
    Mathematical Problems in Engineering, 2022, 2022
  • [7] Parallel Implementation of FEM Solver for Shared Memory Using OpenMP
    Majeed, Rabia
    Farrukh, Ramisha
    Riaz, Omer
    Ali, Sikandar
    Samad, Ali
    Khan, Mukhtaj
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [8] Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters
    Yang, Chao-Tung
    Huang, Chih-Lin
    Lin, Cheng-Fang
    COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (01) : 266 - 269
  • [9] Performance Evaluation of Hybrid Parallel Computing for WRF Model with CUDA and OpenMP
    Ridwan, Ridwan
    Kistijantoro, Achmad Imam
    Kudsy, Mahally
    Gunawan, Djoko
    2015 3rd International Conference on Information and Communication Technology (ICoICT), 2015, : 425 - 430
  • [10] Efficient GPU implementation of a Boltzmann-Schrodinger-Poisson solver for the simulation of nanoscale DG MOSFETs
    Vecil, Francesco
    Mantas, Jose Miguel
    Alonso-Jorda, Pedro
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (12): : 13370 - 13401