Scalable Fast Multipole Method for Large-Scale Electromagnetic Scattering Problems on Heterogeneous CPU-GPU Clusters

被引:4
|
作者
Vinh Dang [1 ]
Tran, Nghia [1 ]
Kilic, Ozlem [1 ]
机构
[1] Catholic Univ Amer, Dept Elect Engn & Comp Sci, Washington, DC 20064 USA
关键词
Fast multipole method; graphics processing unit; heterogeneous clusters; message passing interface (MPI); portable operating system interface (POSIX); ALGORITHM;
D O I
10.1109/LAWP.2016.2537779
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter investigates a hybrid framework for the solution of large-scale electromagnetic scattering problems by using the fast multipole method (FMM) on a heterogeneous CPU-GPU system. Enabling the use of both CPU and GPU resources available in the cluster allows for solving significantly larger problem sizes than using only CPU or GPU resources. The performance is evaluated on a 13-node cluster equipped with NVIDIA Tesla M2090 GPUs. The experimental results demonstrate that our FMM implementation on CPU-GPU is up to 72.3x faster than that of the 12-core eight-node CPU implementation. The scalability of the CPU-GPU implementation is very close to the theoretical expectations.
引用
下载
收藏
页码:1807 / 1810
页数:4
相关论文
共 50 条
  • [1] Parallelizing Fast Multipole Method for Large-Scale Electromagnetic Problems Using GPU Clusters
    Nguyen, Quang M.
    Vinh Dang
    Kilic, Ozlem
    El-Araby, Esam
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2013, 12 : 868 - 871
  • [2] Fast Multipole Method for Large-Scale Electromagnetic Scattering Problems on GPU Cluster and FPGA-Accelerated Platforms
    Dang, V.
    Nguyen, Q.
    Kilic, O.
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2013, 28 (12): : 1187 - 1198
  • [3] Multilevel Fast Multipole Algorithm for Dielectric Large-scale Electromagnetics Scattering Problems using GPU clusters
    Nghia Tran
    Kilic, Ozlem
    2016 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM, 2016, : 755 - 756
  • [4] Revisiting Linpack Algorithm on Large-scale CPU-GPU Heterogeneous Systems
    Shui, Chaoyang
    Yu, Xianzhi
    Yan, Yujin
    Wang, Yinshan
    Meng, Ke
    Tan, Guangming
    PROCEEDINGS OF THE 25TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '20), 2020, : 411 - 412
  • [5] Parallelizing Multilevel Fast Multipole Algorithm for Large-Scale Electromagnetic Problem on GPU clusters
    Nghia Tran
    Tuan Phan
    Kilic, Ozlem
    2016 IEEE/ACES INTERNATIONAL CONFERENCE ON WIRELESS INFORMATION TECHNOLOGY AND SYSTEMS (ICWITS) AND APPLIED COMPUTATIONAL ELECTROMAGNETICS (ACES), 2016,
  • [6] A heterogeneous parallel implementation of the Markov clustering algorithm for large-scale biological networks on distributed CPU-GPU clusters
    Fu, You
    Zhou, Wei
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (07): : 9017 - 9037
  • [7] A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters
    Shi, Rong
    Potluri, Sreeram
    Hamidouche, Khald
    Lu, Xiaoyi
    Tomko, Karen
    Panda, Dhabaleswar K.
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [8] Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems
    Kreutzer, Moritz
    Hager, Georg
    Wellein, Gerhard
    Pieper, Andreas
    Alvermann, Andreas
    Fehske, Holger
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 417 - 426
  • [9] MAPPING THE SBR AND TW-ILDCs TO HETEROGENEOUS CPU-GPU ARCHITECTURE FOR FAST COMPUTATION OF ELECTROMAGNETIC SCATTERING
    Gao, P. C.
    Tao, Y. B.
    Bai, Z. H.
    Lin, H.
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2012, 122 : 137 - 154
  • [10] Optimizing the LINPACK Algorithm for Large-Scale PCIe-Based CPU-GPU Heterogeneous Systems
    Tan, Guangming
    Shui, Chaoyang
    Wang, Yinshan
    Yu, Xianzhi
    Yan, Yujin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (09) : 2367 - 2380