An Analysis of Mapping Polybench Kernels to HPC CGRAs

被引:1
|
作者
Weinhardt, Markus [1 ]
机构
[1] Osnabruck Univ Appl Sci, Fac Engn & Comp Sci, Osnabruck, Germany
关键词
Coarse-Grained Reconfigurable Array; High-Performance Computing; Polybench;
D O I
10.1109/IPDPSW55747.2022.00114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a detailed analysis of Mapping the Polybench C 4.2.1 kernels to Coarse-Grain Reconfigurable Arrays (CGRAs), targeting High-Performance Computing (HPC). The results show that the Polybench kernels are well suited for acceleration on a CGRA due to their regular array accesses. However, seperately mapping the innermost loops of the Polybench kernels to a CGRA yields only limited speedups because the small size of the generated dataflow graphs limits the available parallelism and results in a low computational intensity. Therefore, loop transformations which will increase the parallelism and the speedups are suggested. While this work focuses on a specific CGRA and its compiler, the observations and conclusions are also transferable to other CGRAs and their compilers.
引用
收藏
页码:647 / 654
页数:8
相关论文
共 50 条
  • [1] The Cost of Flexibility: Embedded versus Discrete Routers in CGRAs for HPC
    Adhi, Boma
    Cortes, Carlos
    Tan, Yiyu
    Kojima, Takuya
    Podobas, Artur
    Sano, Kentaro
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 347 - 356
  • [2] Exploration of Compute vs. Interconnect Tradeoffs in CGRAs for HPC
    Anderson, Jason
    Adhi, Boma
    Cortes, Carlos
    Del Sozzo, Emanuele
    Ragheb, Omar
    Sano, Kentaro
    [J]. THE PROCEEDINGS OF THE 13TH INTERNATIONAL SYMPOSIUM ON HIGHLY EFFICIENT ACCELERATORS AND RECONFIGURABLE TECHNOLOGIES, HEART 2023, 2023, : 59 - 68
  • [3] Application kernels: HPC resources performance monitoring and variance analysis
    Simakov, Nikolay A.
    White, Joseph P.
    DeLeon, Robert L.
    Ghadersohi, Amin
    Furlani, Thomas R.
    Jones, Matthew D.
    Gallo, Steven M.
    Patra, Abani K.
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 5238 - 5260
  • [4] PathSeeker: A Fast Mapping Algorithm for CGRAs
    Balasubramanian, Mahesh
    Shrivastava, Aviral
    [J]. PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 268 - 273
  • [5] A Scalable Mapping Method for Elastic CGRAs
    Saito, Makoto
    Kojima, Takuya
    Takase, Hideki
    Nakamura, Hiroshi
    [J]. 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 650 - 657
  • [6] Exploration Framework for Synthesizable CGRAs Targeting HPC: Initial Design and Evaluation
    Adhi, Boma
    Cortes, Carlos
    Tan, Yiyu
    Kojima, Takuya
    Podobas, Artur
    Sano, Kentaro
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 639 - 646
  • [7] Graph Minor Approach for Application Mapping on CGRAs
    Chen, Liang
    Mitra, Tulika
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (03)
  • [8] RAMP: Resource-Aware Mapping for CGRAs
    Dave, Shail
    Balasubramanian, Mahesh
    Shrivastava, Aviral
    [J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [9] Evaluating HPC Kernels for Processing in Memory
    Asifuzzaman, Kazi
    Monil, Mohammad Alaul Haque
    Liu, Frank
    Vetter, Jeffrey S.
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2022, 2022,
  • [10] Compiling HPC Kernels for the REDEFINE CGRA
    Madhu, Kavitha T.
    Das, Saptarsi
    Nalesh, S.
    Nandy, S. K.
    Narayan, Ranjani
    [J]. 2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 405 - 410