A Cache-Aware Approach to Domain Decomposition for Stencil-Based Codes

被引:0
|
作者
Saxena, Gaurav [1 ]
Jimack, Peter K. [1 ]
Walkley, Mark A. [1 ]
机构
[1] Univ Leeds, Sch Comp, Leeds LS2 9JT, W Yorkshire, England
关键词
PDEs; Domain Decomposition; Stencil; Quasi-cache-directed; Cache-oblivious;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial Differential Equations (PDEs) lie at the heart of numerous scientific simulations depicting physical phenomena. The parallelization of such simulations introduces additional performance penalties in the form of local and global synchronization among cooperating processes. Domain decomposition partitions the largest shareable data structures into sub-domains and attempts to achieve perfect load balance and minimal communication. Up to now research efforts to optimize spatial and temporal cache reuse for stencil-based PDE discretizations (e.g. finite difference and finite element) have considered sub-domain operations after the domain decomposition has been determined. We derive a cache-oblivious heuristic that minimizes cache misses at the sub-domain level through a quasi-cache-directed analysis to predict families of high performance domain decompositions in structured 3-D grids. To the best of our knowledge this is the first work to optimize domain decompositions by analyzing cache misses - thus connecting single core parameters (i.e. cache-misses) to true multicore parameters (i.e. domain decomposition). We analyze the trade-offs in decreasing cache-misses through such decompositions and increasing the dynamic bandwidth-per-core. The limitation of our work is that currently, it is applicable only to structured 3-D grids with cuts parallel to the Cartesian Axes. We emphasize and conclude that there is an imperative need to re-think domain decompositions in this constantly evolving multicore era.
引用
收藏
页码:875 / 885
页数:11
相关论文
共 50 条
  • [1] A Cache-Aware Approach to Adaptive Mesh Refinement in Parallel Stencil-based Solvers
    Saxena, Gaurav
    Jimack, Peter K.
    Walkley, Mark A.
    [J]. 2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS), 2017, : 364 - 371
  • [2] Performance Optimisation of Stencil-Based Codes for Shared Memory Architectures
    Abalenkovs, Maksims
    [J]. 2017 11TH EUROPEAN CONFERENCE ON ANTENNAS AND PROPAGATION (EUCAP), 2017, : 3231 - 3234
  • [3] Cache-Aware Out-of-Core Tensor Decomposition on GPUs
    Tsai, Yu-Ting
    Wang, Wei-Jhih
    Kao, Tzu-Yuan
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2018, 34 (06) : 1383 - 1403
  • [4] Energy-aware mechanism for stencil-based MPDATA algorithm with constraints
    Rojek, Krzysztof
    Ilic, Aleksandar
    Wyrzykowski, Roman
    Sousa, Leonel
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (08):
  • [5] CANR: CACHE-AWARE NAME-BASED ROUTING
    Hu, Xiaoyan
    Gong, Jian
    [J]. 2014 IEEE 3rd International Conference on Cloud Computing and Intelligence Systems (CCIS), 2014, : 212 - 217
  • [6] ICN-based cache-aware routing scheme in MSN
    Shi, Junling
    Wang, Xingwei
    Huang, Min
    [J]. AD HOC NETWORKS, 2018, 75-76 : 106 - 118
  • [7] A Cache-Aware Approach for Dynamic Adaptive Video Streaming over HTTP
    Liu, Yudan
    Lin, Tao
    Liu, Zhilei
    Dai, Liang
    [J]. 2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 627 - 632
  • [8] CATS: cache-aware task scheduling for Hadoop-based systems
    Lim, Byungnam
    Kim, Jong Wook
    Chung, Yon Dohn
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3691 - 3705
  • [9] LCA: A Memory Link and Cache-Aware Co-Scheduling Approach for CMPs
    Haritatos, Alexandros-Herodotos
    Goumas, Georgios
    Anastopoulos, Nikos
    Nikas, Konstantinos
    Kourtis, Kornilios
    Koziris, Nectarios
    [J]. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 469 - 470
  • [10] CATS: cache-aware task scheduling for Hadoop-based systems
    Byungnam Lim
    Jong Wook Kim
    Yon Dohn Chung
    [J]. Cluster Computing, 2017, 20 : 3691 - 3705