StreamPIM: Streaming Matrix Computation in Racetrack Memory

被引:3
|
作者
An, Yuda [1 ]
Tang, Yunxiao [1 ]
Yi, Shushu [1 ]
Peng, Li [1 ]
Pan, Xiurui [1 ]
Sun, Guangyu [1 ,3 ]
Luo, Zhaochu [1 ]
Li, Qiao [2 ]
Zhang, Jie [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Xiamen Univ, Xiamen, Peoples R China
[3] Beijing Adv Innovat Ctr Integrated Circuits, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
CIRCUIT; DESIGN; ENERGY; GPUS;
D O I
10.1109/HPCA57654.2024.00031
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Racetrack memory (RM) techniques have become promising solutions to resolve the memory wall issue as they increase memory density, reduce energy consumption and are capable of building processing-in-memory (PIM) architectures. RM can place arithmetic logic units in or near its memory arrays to process tasks offloaded by the host. While there already exist multiple studies of processing in RM, these solutions, unfortunately, suffer from data transfer overheads imposed by the loose coupling of the memory core and the computation units. To address this issue, we propose StreamPIM, a new processing-in-RM architecture, which tightly couples the memory core and the computation units. Specifically, StreamPIM directly constructs a matrix processor from domain-wall nanowires without the usage of CMOS-based computation units. It also designs a domainwall nanowire-based bus, which can eliminate electromagnetic conversion. StreamPIM further optimizes the performance by leveraging RM internal parallelism. Our evaluation results show that StreamPIM achieves 39.1x higher performance and saves 58.4x energy consumption, compared with the traditional computing platform.
引用
收藏
页码:297 / 311
页数:15
相关论文
共 50 条
  • [31] ShiftsReduce: Minimizing Shifts in Racetrack Memory 4.0
    Khan, Asif Ali
    Hameed, Fazal
    Blaesing, Robin
    Parkin, Stuart S. P.
    Castrillon, Jeronimo
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (04)
  • [32] VLIW-Based FPGA Computation Fabric with Streaming Memory Hierarchy for Medical Imaging Applications
    Hoozemans, Joost
    Heij, Rolf
    van Straten, Jeroen
    Al-Ars, Zaid
    APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 36 - 43
  • [33] Benchmarking Streaming Computation Engines: Storm, Flink and Spark Streaming
    Chintapalli, Sanket
    Dagit, Derek
    Evans, Bobby
    Farivar, Reza
    Graves, Thomas
    Holderbaugh, Mark
    Liu, Zhuo
    Nusbaum, Kyle
    Patil, Kishorkumar
    Peng, Boyang Jerry
    Poulosky, Paul
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1789 - 1792
  • [34] Energy Efficient In-memory Integer Multiplication Based on Racetrack Memory
    Luo, Tao
    Zhang, Wei
    He, Bingsheng
    Liu, Cheng
    Maskell, Douglas
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1409 - 1414
  • [35] A Racetrack Memory Based In-memory Booth Multiplier for Cryptography Application
    Luo, Tao
    Zhang, Wei
    He, Bingsheng
    Maskell, Douglas
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 286 - 291
  • [36] Spatial memory streaming
    Somogyi, Stephen
    Wenisch, Thomas F.
    33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHTIECTURE, PROCEEDINGS, 2006, : 252 - 263
  • [37] Streaming Model Computation of the FDTD Problem
    Smyk, Adam
    Tudruj, Marek
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING, PT I, 2012, 7133 : 184 - 192
  • [38] Streaming Complexity of Spanning Tree Computation
    Chang, Yi-Jun
    Farach-Colton, Martin
    Hsu, Tsan-Sheng
    Tsai, Meng-Tsung
    37TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2020), 2020, 154
  • [39] Computation of Streaming Current in Oil Pipes
    Wang, Jufen
    Meng, Haolong
    IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2009, 16 (02) : 299 - 304
  • [40] Optimizing Data Layout for Racetrack Memory in Embedded Systems
    Hui, Peng
    Sha, Edwin H. -M.
    Zhuge, Qingfeng
    Xu, Rui
    Wang, Han
    2023 28TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC, 2023, : 110 - 115