Enabling Multi-threaded Applications on Hybrid Shared Memory Manycore Architectures

被引:0
|
作者
Rawat, Tushar [1 ]
Shrivastava, Aviral [1 ]
机构
[1] Arizona State Univ, Comp Microarchitecture Lab, Tempe, AZ 85281 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the number of cores per chip increases, maintaining cache coherence becomes prohibitive for both power and performance. Non Coherent Cache (NCC) architectures do away with hardware-based cache coherence, but become difficult to program. Some existing architectures provide a middle ground by providing some shared memory in the hardware. Specifically, the 48-core Intel Single-chip Cloud Computer (SCC) provides some off-chip (DRAM) shared memory and some on-chip (SRAM) shared memory. We call such architectures Hybrid Shared Memory, or HSM, manycore architectures. However, how to efficiently execute multi-threaded programs on HSM architectures is an open problem. To be able to execute a multi-threaded program correctly on HSM architectures, the compiler must: i) identify all the shared data and map it to the shared memory, and ii) map the frequently accessed shared data to the on-chip shared memory. In this paper, we present a source-to-source translator written using CETUS (Dave et al. [1]) that identifies a conservative superset of all the shared data in a multi-threaded application, and maps it to the off-chip shared memory such that it enables execution on HSM architectures. This improves the performance of our benchmarks by 32x. Following, we identify and map the frequently accessed shared data to the on-chip shared memory. This further improves the performance of our benchmarks by 8x on average.
引用
收藏
页码:742 / 747
页数:6
相关论文
共 50 条
  • [1] INCREMENTAL MULTI-THREADED GARBAGE COLLECTION ON VIRTUALLY SHARED-MEMORY ARCHITECTURES
    LESERGENT, T
    BERTHOMIEU, B
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1992, 637 : 179 - 199
  • [2] A scalability prediction approach for multi-threaded applications on manycore processors
    Bai, Xiuxiu
    Wang, Endong
    Dong, Xiaoshe
    Zhang, Xingjun
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (11): : 4072 - 4094
  • [3] A scalability prediction approach for multi-threaded applications on manycore processors
    Xiuxiu Bai
    Endong Wang
    Xiaoshe Dong
    Xingjun Zhang
    [J]. The Journal of Supercomputing, 2015, 71 : 4072 - 4094
  • [4] Workshop on Multi-Threaded Architectures and Applications - MTAAP
    DeRose, Luiz
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1571 - 1571
  • [5] A Fast Profiler for Compilation of Multi-Threaded Applications on a Hybrid Memory System
    Dadzie, Thomas Haywood
    Cho, SeungPyo
    Oh, Hyunok
    [J]. 2017 IEEE 6TH NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM (NVMSA 2017), 2017,
  • [6] Workshop on multi-threaded architectures and applications-MTAAP
    [J]. Derose, Luiz, 1600, IEEE Computer Society
  • [7] Multi-threaded design for a software distributed shared memory system
    Ueng, JC
    Shieh, CK
    Mac, SC
    Lai, AC
    Liang, TY
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1999, E82D (12) : 1512 - 1523
  • [8] NAS integer sort on multi-threaded shared memory machines
    Grün, T
    Hillebrand, MA
    [J]. EURO-PAR '98 PARALLEL PROCESSING, 1998, 1470 : 999 - 1009
  • [9] An efficient multi-threaded memory allocator for PDES applications
    Li, Tianlin
    Yao, Yiping
    Tang, Wenjie
    Zhu, Feng
    Lin, Zhongwei
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2020, 100
  • [10] Scratch pad-Memory Management for Multi-Threaded Applications on Many-Core Architectures
    Venkataramani, Vanchinathan
    Chan, Mun Choon
    Mitra, Tulika
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (01)