StreamPIM: Streaming Matrix Computation in Racetrack Memory

被引:3
|
作者
An, Yuda [1 ]
Tang, Yunxiao [1 ]
Yi, Shushu [1 ]
Peng, Li [1 ]
Pan, Xiurui [1 ]
Sun, Guangyu [1 ,3 ]
Luo, Zhaochu [1 ]
Li, Qiao [2 ]
Zhang, Jie [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Xiamen Univ, Xiamen, Peoples R China
[3] Beijing Adv Innovat Ctr Integrated Circuits, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
CIRCUIT; DESIGN; ENERGY; GPUS;
D O I
10.1109/HPCA57654.2024.00031
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Racetrack memory (RM) techniques have become promising solutions to resolve the memory wall issue as they increase memory density, reduce energy consumption and are capable of building processing-in-memory (PIM) architectures. RM can place arithmetic logic units in or near its memory arrays to process tasks offloaded by the host. While there already exist multiple studies of processing in RM, these solutions, unfortunately, suffer from data transfer overheads imposed by the loose coupling of the memory core and the computation units. To address this issue, we propose StreamPIM, a new processing-in-RM architecture, which tightly couples the memory core and the computation units. Specifically, StreamPIM directly constructs a matrix processor from domain-wall nanowires without the usage of CMOS-based computation units. It also designs a domainwall nanowire-based bus, which can eliminate electromagnetic conversion. StreamPIM further optimizes the performance by leveraging RM internal parallelism. Our evaluation results show that StreamPIM achieves 39.1x higher performance and saves 58.4x energy consumption, compared with the traditional computing platform.
引用
收藏
页码:297 / 311
页数:15
相关论文
共 50 条
  • [21] Exploring Main Memory Design Based on Racetrack Memory Technology
    Hu, Qingda
    Sung, Guangyu
    Shu, Jiwu
    Zhang, Chao
    2016 INTERNATIONAL GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI), 2016, : 397 - 402
  • [22] Streaming computation of combinatorial objects
    Bar-Yossef, Z
    Reingold, O
    Shaltiel, R
    Trevisan, L
    17TH ANNUAL IEEE CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2002, : 165 - 174
  • [23] Streaming computation of Delaunay triangulations
    Isenburg, Martin
    Liu, Yuanxin
    Shewchuk, Jonathan
    Snoeyink, Jack
    ACM TRANSACTIONS ON GRAPHICS, 2006, 25 (03): : 1049 - 1056
  • [24] Connected-Skyrmions-Based Racetrack Memory
    Nishitani, Taichi
    Honda, Syuta
    Itoh, Hiroyoshi
    2023 IEEE INTERNATIONAL MAGNETIC CONFERENCE, INTERMAG, 2023,
  • [25] Complementary Skyrmion Racetrack Memory With Voltage Manipulation
    Kang, Wang
    Zheng, Chentian
    Huang, Yangqi
    Zhang, Xichao
    Zhou, Yan
    Lv, Weifeng
    Zhao, Weisheng
    IEEE ELECTRON DEVICE LETTERS, 2016, 37 (07) : 924 - 927
  • [26] Magnetic domain-wall racetrack memory
    Parkin, Stuart S. P.
    Hayashi, Masamitsu
    Thomas, Luc
    SCIENCE, 2008, 320 (5873) : 190 - 194
  • [27] For a really great memory, spend a day at the racetrack
    Marks, Paul
    NEW SCIENTIST, 2008, 198 (2652) : 20 - 21
  • [28] Skyrmions-based magnetic racetrack memory
    Liang Xue
    Zhao Li
    Qiu Lei
    Li Shuang
    Ding Li-Hong
    Feng You-Hua
    Zhang Xi-Chao
    Zhou Yan
    Zhao Guo-Ping
    ACTA PHYSICA SINICA, 2018, 67 (13)
  • [29] Design and Data Management for Magnetic Racetrack Memory
    Li, Bing
    Chen, Fan
    Kang, Wang
    Zhao, Weisheng
    Chen, Yiran
    Li, Hai
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [30] Correcting Multiple Deletions and Insertions in Racetrack Memory
    Sima, Jin
    Bruck, Jehoshua
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (09) : 5619 - 5639