Optimization and Performance Modeling of Stencil Computations on ARM Architectures

被引:0
|
作者
Zhang, Kaifang [1 ]
Su, Huayou [1 ]
Zhang, Peng [2 ,3 ]
Dou, Yong [1 ]
机构
[1] National University of Defense Technology, National Key Laboratory for Parallel and Distribution Processing, Changsha, China
[2] Caep Software Center for High Performance Numerical Simulation, Beijing, China
[3] Institute of Applied Physics and Computational Mathematics, Beijing, China
关键词
ARM architecture - Optimization and performance - Optimization parameter - Performance Model - Performance monitors - Prediction errors - Scientific and engineering applications - Stencil computations;
D O I
9408060
中图分类号
学科分类号
摘要
26
引用
收藏
页码:113 / 121
相关论文
共 50 条
  • [1] Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors
    Datta, Kaushik
    Kamil, Shoaib
    Williams, Samuel
    Oliker, Leonid
    Shalf, John
    Yelick, Katherine
    [J]. SIAM REVIEW, 2009, 51 (01) : 129 - 159
  • [2] Modeling Stencil Computations on Modern HPC Architectures
    de la Cruz, Raul
    Araya-Polo, Mauricio
    [J]. HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION, 2015, 8966 : 149 - 171
  • [3] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
    Zhang, Kaifang
    Su, Huayou
    Dou, Yong
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (11): : 13584 - 13600
  • [4] Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures
    Kaifang Zhang
    Huayou Su
    Yong Dou
    [J]. The Journal of Supercomputing, 2021, 77 : 13584 - 13600
  • [5] Unleashing the performance of ccNUMA multiprocessor architectures in heterogeneous stencil computations
    Szustak, Lukasz
    Halbiniak, Kamil
    Wyrzykowski, Roman
    Jakl, Ondrej
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (12): : 7765 - 7777
  • [6] Unleashing the performance of ccNUMA multiprocessor architectures in heterogeneous stencil computations
    Lukasz Szustak
    Kamil Halbiniak
    Roman Wyrzykowski
    Ondřej Jakl
    [J]. The Journal of Supercomputing, 2019, 75 : 7765 - 7777
  • [7] Modeling Optimization of Stencil Computations Via Domain-level Properties
    Nesterenko, Brandon
    Yi, Qing
    Runnels, Brandon
    Lin, Pei-Hung
    Liao, Chunhua
    [J]. PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL WORKSHOP ON PROGRAMMING MODELS AND APPLICATIONS FOR MULTICORES AND MANYCORES (PMAM '22), 2022, : 35 - 44
  • [8] Performance Improvement of Stencil Computations for Multi-core Architectures based on Machine Learning
    Martinez, Victor
    Dupros, Fabrice
    Castro, Marcio
    Navaux, Philippe
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 305 - 314
  • [9] Automatic Performance Tuning of Stencil Computations on GPUs
    Garvey, Joseph D.
    Abdelrahman, Tarek S.
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 300 - 309
  • [10] Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures
    Lin, Pei-Hung
    Yi, Qing
    Quinlan, Daniel
    Liao, Chunhua
    Yan, Yongqing
    [J]. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2016, 2017, 10136 : 137 - 152