DHTS: A Dynamic Hybrid Tiling Strategy for Optimizing Stencil Computation on GPUs

被引:0
|
作者
Liu, Song [1 ]
Zhang, Zengyuan [1 ]
Wu, Weiguo [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Stencil computation; dynamic hybrid tiling; performance;
D O I
10.1109/TC.2023.3271060
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Stencil computation is an important class of computational modes in scientific computing applications. Loop tiling techniques have been widely studied to accelerate stencil computations on different architectures by exploiting parallelism and data locality. Recent advanced tiling methods enable the tile-wise concurrent start-up to improve the execution performance. However, such methods statically partition all dimensions of iteration space into tiles with predetermined complex shapes and sizes, and thus lead to low thread utilization and memory access efficiency on GPUs. In this paper, we present DHTS, a novel dynamic hybrid tiling strategy for stencil computations. DHTS employs static tiling on the outer dimensions to achieve concurrent start-up parallelism, while proposes a dynamic rectangular tiling method on the inner dimensions to improve thread utilization and memory access efficiency. By deriving tile size constraints, DHTS adaptively achieves equal-size workload of tiles, and therefore reducing idle threads and increasing coalesced memory accesses within tiles. We implement the proposed strategy with different complex tile shapes. Experimental results on Titan V and Tesla V100 GPUs show that DHTS effectively improves the execution performance of 2D/3D stencils compared to state-of-the-art tiling methods, and achieves the best improvement of 28x.
引用
收藏
页码:2795 / 2807
页数:13
相关论文
共 50 条
  • [21] Energy-efficient Stencil Computations on Distributed GPUs using Dynamic Parallelism and GPU-controlled Communication
    Oden, Lena
    Klenk, Benjamin
    Froening, Holger
    2014 ENERGY EFFICIENT SUPERCOMPUTING WORKSHOP (E2SC), 2014, : 31 - 40
  • [22] Multi-level spatial and temporal tiling for efficient HPC stencil computation on many-core processors with large shared caches
    Yount, Charles
    Duran, Alejandro
    Tobin, Josh
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 92 : 903 - 919
  • [23] Effective Use of Large High-Bandwidth Memory Caches in HPC Stencil Computation via Temporal Wave-Front Tiling
    Yount, Charles
    Duran, Alejandro
    PROCEEDINGS OF PMBS 2016: 7TH INTERNATIONAL WORKSHOP ON PERFORMANCE MODELING, BENCHMARKING AND SIMULATION OF HIGH PERFORMANCE COMPUTING SYSTEMS, 2016, : 65 - 75
  • [24] Dynamic strategy based parallel ant colony optimization on GPUs for TSPs
    Zhou, Yi
    He, Fazhi
    Qiu, Yimin
    SCIENCE CHINA-INFORMATION SCIENCES, 2017, 60 (06)
  • [25] Dynamic strategy based parallel ant colony optimization on GPUs for TSPs
    Yi ZHOU
    Fazhi HE
    Yimin QIU
    ScienceChina(InformationSciences), 2017, 60 (06) : 260 - 262
  • [26] Dynamic strategy based parallel ant colony optimization on GPUs for TSPs
    Yi Zhou
    Fazhi He
    Yimin Qiu
    Science China Information Sciences, 2017, 60
  • [27] Optimizing Energy Consumption and Parallel Performance for Static and Dynamic Betweenness Centrality using GPUs
    McLaughlin, Adam
    Riedy, Jason
    Bader, David A.
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [28] Wheel: Accelerating CNNs with Distributed GPUs via Hybrid Parallelism and Alternate Strategy
    Du, Xiaoyu
    Tang, Jinhui
    Li, Zechao
    Qin, Zhiguang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 393 - 401
  • [29] Portfolio Strategy Optimizing Model for Risk Management Utilizing Evolutionary Computation
    Matsumura, Koki
    Kakinoki, Hidefumi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2014, 97 (08) : 45 - 62
  • [30] A Routing Strategy with Optimizing Linear Programming in Hybrid SDN
    Wang, Chenhui
    Ni, Hong
    Liu, Lei
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2022, E105B (05) : 569 - 579