DHTS: A Dynamic Hybrid Tiling Strategy for Optimizing Stencil Computation on GPUs

被引:0
|
作者
Liu, Song [1 ]
Zhang, Zengyuan [1 ]
Wu, Weiguo [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Stencil computation; dynamic hybrid tiling; performance;
D O I
10.1109/TC.2023.3271060
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Stencil computation is an important class of computational modes in scientific computing applications. Loop tiling techniques have been widely studied to accelerate stencil computations on different architectures by exploiting parallelism and data locality. Recent advanced tiling methods enable the tile-wise concurrent start-up to improve the execution performance. However, such methods statically partition all dimensions of iteration space into tiles with predetermined complex shapes and sizes, and thus lead to low thread utilization and memory access efficiency on GPUs. In this paper, we present DHTS, a novel dynamic hybrid tiling strategy for stencil computations. DHTS employs static tiling on the outer dimensions to achieve concurrent start-up parallelism, while proposes a dynamic rectangular tiling method on the inner dimensions to improve thread utilization and memory access efficiency. By deriving tile size constraints, DHTS adaptively achieves equal-size workload of tiles, and therefore reducing idle threads and increasing coalesced memory accesses within tiles. We implement the proposed strategy with different complex tile shapes. Experimental results on Titan V and Tesla V100 GPUs show that DHTS effectively improves the execution performance of 2D/3D stencils compared to state-of-the-art tiling methods, and achieves the best improvement of 28x.
引用
收藏
页码:2795 / 2807
页数:13
相关论文
共 50 条
  • [31] A hybrid strategy for dynamic process flowsheeting
    Riksheim, HC
    Hertzberg, T
    COMPUTERS & CHEMICAL ENGINEERING, 1998, 22 : S805 - S808
  • [32] The acceleration strategy of a simplified hybrid domain full waveform inversion on multi-GPUs
    Gui Sheng
    Liu Hong
    Zhang Yu-Jie
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2017, 60 (02): : 665 - 677
  • [33] PARAMETER OPTIMIZATION IN DYNAMIC SYSTEMS VIA HYBRID COMPUTATION
    BERTER, LG
    SIMULATION, 1970, 15 (04) : 191 - &
  • [34] HybridFTW: Hybrid Computation of Dynamic Time Warping Distances
    Lee, Minwoo
    Lee, Sanghun
    Choi, Mi-Jung
    Moon, Yang-Sae
    Lim, Hyo-Sang
    IEEE ACCESS, 2018, 6 : 2085 - 2096
  • [35] Dynamic computation and context effects in the hybrid architecture AKIRA
    Pezzulo, G
    Calvi, G
    MODELING AND USING CONTEXT, PROCEEDINGS, 2005, 3554 : 368 - 381
  • [36] LIFETIME MAXIMIZATION BY OPTIMIZING THE COMPUTATION/COMMUNICATION STRATEGY FOR RAILWAY WIRELESS MONITORING SYSTEM
    Ma, Xiaoping
    Dong, Honghui
    Liu, Xiang
    Jia, Limin
    PROCEEDINGS OF THE ASME JOINT RAIL CONFERENCE, 2018, 2018,
  • [37] A hybrid evolutionary computation approach with its application for optimizing text document clustering
    Song, Wei
    Qiao, Yingying
    Park, Soon Cheol
    Qian, Xuezhong
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) : 2517 - 2524
  • [38] A heuristic algorithm with a dynamic generation strategy for optimizing energy systems
    Duan, Huanhuan
    Yang, Ling
    Xiao, Yuan
    Huang, Xiaohuang
    Cui, Guomin
    APPLIED THERMAL ENGINEERING, 2024, 236
  • [39] Multi-step strategy for optimizing complex dynamic processes
    Kwon, SP
    Kim, TH
    Yoon, ES
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3177 - 3182
  • [40] Dynamic Convolution Modeling, a Hybrid Synthesis Strategy
    Bessell, David
    COMPUTER MUSIC JOURNAL, 2013, 37 (01) : 44 - 51