Region-based parallelization of irregular reductions on explicitly managed memory hierarchies

被引:0
|
作者
Seonggun Kim
Hwansoo Han
Kwang-Moo Choe
机构
[1] KAIST,Department of Computer Science
[2] Sungkyunkwan University,Department of Computer Engineering
来源
关键词
Explicitly managed memory hierarchies; Irregular reduction; Loop parallelization; Compilation techniques;
D O I
暂无
中图分类号
学科分类号
摘要
Multicore architectures are evolving with the promise of extreme performance for the classes of applications that require high performance and large bandwidth of memory. Irregular reduction is one of important computation patterns for many complex scientific applications, and it typically requires high performance and large bandwidth of memory. In this article, we propose region-based parallelization techniques for irregular reductions on multicore architectures with explicitly managed memory hierarchies. Managing memory hierarchy in software requires a lot of programming efforts and tends to be error-prone. The difficulties are even worse for applications with irregular data access patterns. To relieve the burden of memory management from programmers, we develop abstractions, particularly targeted to irregular reduction, for structuring parallel tasks, mapping the parallel tasks to processing units and scheduling data transfers between the memory hierarchies. Our framework employs iteration reordering based on regions of data along with dynamic scheduling of parallel tasks. We experimentally evaluate the effectiveness of our techniques for irregular reduction kernels on the Cell processor embedded in a Sony PlayStation3. Experimental results show the speedups of 8 to 14 on the six available SPEs.
引用
收藏
页码:25 / 55
页数:30
相关论文
共 37 条
  • [31] A Region-based Fault-Tolerant Routing Algorithmfor 2D Irregular Mesh Network-on-Chip
    Yusuke Fukushima
    Masaru Fukushi
    Ikuko Eguchi Yairi
    [J]. Journal of Electronic Testing, 2013, 29 : 415 - 429
  • [32] Region-based Memory Management for GPU Programming Languages Enabling Rich Data Structures on a Spartan Host
    Holk, Eric
    Newton, Ryan
    Siek, Jeremy
    Lumsdaine, Andrew
    [J]. ACM SIGPLAN NOTICES, 2014, 49 (10) : 141 - 155
  • [33] A Constant-Time Region-Based Memory Allocator for Embedded Systems with Unpredictable Length Array Generation
    Sanchez, Jordi
    Benet, Gines
    Simo, Jose E.
    [J]. 2010 IEEE CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2010,
  • [34] Guided Region-Based GPU Scheduling: Utilizing Multi-thread Parallelism to Hide Memory Latency
    Chen, Jianmin
    Tao, Xi
    Yang, Zhen
    Peir, Jih-Kwon
    Li, Xiaoyuan
    Lu, Shih-Lien
    [J]. IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 441 - 451
  • [35] Reversible Object-Oriented Programming with Region-Based Memory Management Work-in-progress Report
    Schultz, Ulrik Pagh
    [J]. REVERSIBLE COMPUTATION, RC 2018, 2018, 11106 : 322 - 328
  • [36] Semi-automatic region-based memory management for real-time Java']Java embedded systems
    Salagnac, Guillaume
    Rippert, Christophe
    Yovine, Sergio
    [J]. 13TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2007, : 73 - +
  • [37] CPA-BAM-BnB: Block-Abstraction Memoization and Region-Based Memory Models for Predicate Abstractions
    Andrianov, Pavel
    Friedberger, Karlheinz
    Mandrykin, Mikhail
    Mutilin, Vadim
    Volkov, Anton
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, TACAS 2017, PT II, 2017, 10206 : 355 - 359