Locality-Conscious Nested-Loops Parallelization

被引:0
|
作者
Parsa, Saeed [1 ]
Hamzei, Mohammad [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Automatic nested loops parallelization; data locality; loop tiling; OPTIMIZATION;
D O I
10.4218/etrij.13.0113.0266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To speed up data-intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [31] Automatic parallelization of canonical loops
    Padovani da Mata, Leonardo Luiz
    Quintao Pereira, Fernando Magno
    Ferreira, Renato
    SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (08) : 1193 - 1206
  • [32] Towards Automatic Parallelization of "for" loops
    Bhat, Amit G.
    Babu, Meghana N.
    Anala, M. R.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 136 - 142
  • [33] The Static Parallelization of Loops and Recursions
    Christian Lengauer
    Sergei Gorlatch
    Christoph Herrmann
    The Journal of Supercomputing, 1997, 11 : 333 - 353
  • [34] Automating verification of loops by parallelization
    Gedell, Tobias
    Hahnle, Reiner
    LOGIC FOR PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND REASONING, PROCEEDINGS, 2006, 4246 : 332 - +
  • [35] The static parallelization of loops and recursions
    Lengauer, C
    Gorlatch, S
    Herrmann, CA
    JOURNAL OF SUPERCOMPUTING, 1997, 11 (04): : 333 - 353
  • [36] Intra-Tile Parallelization for Two-Level Perfectly Nested Loops With Non-Uniform Dependences
    Reyhan, Zahra Abdi
    Lotfi, Shahriar
    Isazadeh, Ayaz
    Karimpour, Jaber
    COMPUTER JOURNAL, 2021, 64 (09): : 1358 - 1383
  • [37] Speculative Parallelization of Sequential Loops on Multicores
    Chen Tian
    Min Feng
    Vijay Nagarajan
    Rajiv Gupta
    International Journal of Parallel Programming, 2009, 37 : 508 - 535
  • [38] Thread based OpenMP for nested parallelization
    Blikberg, R
    Sorevik, T
    PARALLEL COMPUTING: SOFTWARE TECHNOLOGY, ALGORITHMS, ARCHITECTURES AND APPLICATIONS, 2004, 13 : 787 - 794
  • [39] Parallelization and aggregation of nested benders decomposition
    Dempster, MAH
    Thompson, RT
    ANNALS OF OPERATIONS RESEARCH, 1998, 81 : 163 - 187
  • [40] Parallelization of recurrent loops with conditional statements
    Shteinberg, BY
    AUTOMATION AND REMOTE CONTROL, 1995, 56 (09) : 1344 - 1350