An Optimization of FMM under CPU plus GPU Heterogeneous Architecture

被引:0
|
作者
Zhu, Yonghua [1 ]
Lu, Xiao [2 ]
机构
[1] Shanghai Univ, Ctr Comp, Room321 Bldg D,99 Shangda Rd, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Sch Engn & Comp Sci, Shanghai 200072, Peoples R China
关键词
GPU; Heterogeneous Architecture; FMM; Threads Mapping Model;
D O I
10.1109/CEC.2012.33
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Heterogeneous architecture of CPU+GPU has been the main trend for high-performance computing/parallel processing in recent years. However, the formulation of scientific algorithms to take advantage of the performance offered by the new architecture requires rethinking core methods. The algorithmic acceleration is achieved with the main part of fast multipole method (FMM) under the heterogeneous architecture. Based on PetFMM, a Two Dimensional Threads Mapping Model (TDTMM) is proposed to lighten the workload per thread on GPU. The presented threads mapping model is able to improve the execution efficiency of hardware acceleration. Experiment results show that the presented models are feasible and effective.
引用
收藏
页码:147 / 150
页数:4
相关论文
共 50 条
  • [1] Heterogeneous CPU plus GPU approaches for HEVC
    Cebrian-Marquez, Gabriel
    Galiano, Vicente
    Migallon, Hector
    Luis Martinez, Jose
    Cuenca, Pedro
    Lopez-Granado, Otoniel
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (03): : 1215 - 1226
  • [2] Heterogeneous CPU plus GPU approaches for HEVC
    Gabriel Cebrián-Márquez
    Vicente Galiano
    Héctor Migallón
    José Luis Martínez
    Pedro Cuenca
    Otoniel López-Granado
    The Journal of Supercomputing, 2019, 75 : 1215 - 1226
  • [3] Performance comparison of CPU and GPU on a discrete heterogeneous architecture
    Thomas, Winnie
    Daruwala, Rohin D.
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 271 - 276
  • [4] Efficient Electro-Thermal Co-analysis on CPU plus GPU Heterogeneous Architecture
    Huang Kun
    Yang Xu
    Zhao Guoxing
    Luo Zuying
    2012 13TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2012, : 364 - 369
  • [5] Performance Optimization for CPU-GPU Heterogeneous Parallel System
    Wang, Yanhua
    Qiao, Jianzhong
    Lin, Shukuan
    Zhao, Tinglei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1259 - 1266
  • [6] Heterogeneous Cache Hierarchy Management for Integrated CPU-GPU Architecture
    Wen, Hao
    Zhang, Wei
    2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [7] Optimization and acceleration of flow simulations for CFD on CPU/GPU architecture
    Lei, Jiang
    Li, Da-li
    Zhou, Yun-long
    Liu, Wei
    JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2019, 41 (07)
  • [8] Cluster optimization algorithm based on CPU and GPU hybrid architecture
    Fei Yin
    Feng Shi
    Cluster Computing, 2022, 25 : 2601 - 2611
  • [9] Benchmarking of High Performance Computing Clusters with Heterogeneous CPU/GPU Architecture
    Sukharev, Pavel V.
    Vasilyev, Nikolay P.
    Rovnyagin, Mikhail M.
    Durnov, Maxim A.
    PROCEEDINGS OF THE 2017 IEEE RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (2017 ELCONRUS), 2017, : 574 - 577
  • [10] Cluster optimization algorithm based on CPU and GPU hybrid architecture
    Yin, Fei
    Shi, Feng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2601 - 2611