StreamPIM: Streaming Matrix Computation in Racetrack Memory

被引:3
|
作者
An, Yuda [1 ]
Tang, Yunxiao [1 ]
Yi, Shushu [1 ]
Peng, Li [1 ]
Pan, Xiurui [1 ]
Sun, Guangyu [1 ,3 ]
Luo, Zhaochu [1 ]
Li, Qiao [2 ]
Zhang, Jie [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Xiamen Univ, Xiamen, Peoples R China
[3] Beijing Adv Innovat Ctr Integrated Circuits, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
CIRCUIT; DESIGN; ENERGY; GPUS;
D O I
10.1109/HPCA57654.2024.00031
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Racetrack memory (RM) techniques have become promising solutions to resolve the memory wall issue as they increase memory density, reduce energy consumption and are capable of building processing-in-memory (PIM) architectures. RM can place arithmetic logic units in or near its memory arrays to process tasks offloaded by the host. While there already exist multiple studies of processing in RM, these solutions, unfortunately, suffer from data transfer overheads imposed by the loose coupling of the memory core and the computation units. To address this issue, we propose StreamPIM, a new processing-in-RM architecture, which tightly couples the memory core and the computation units. Specifically, StreamPIM directly constructs a matrix processor from domain-wall nanowires without the usage of CMOS-based computation units. It also designs a domainwall nanowire-based bus, which can eliminate electromagnetic conversion. StreamPIM further optimizes the performance by leveraging RM internal parallelism. Our evaluation results show that StreamPIM achieves 39.1x higher performance and saves 58.4x energy consumption, compared with the traditional computing platform.
引用
收藏
页码:297 / 311
页数:15
相关论文
共 50 条
  • [1] Parallel Computation in the Racetrack Memory
    Riente, Fabrizio
    Turvani, Giovanna
    Vacca, Marco
    Graziano, Mariagrazia
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (02) : 1216 - 1221
  • [2] Memory on the racetrack
    Stuart Parkin
    See-Hun Yang
    Nature Nanotechnology, 2015, 10 : 195 - 198
  • [3] Racetrack memory
    Greene, Kate
    Technology Review, 2009, 112 (02) : 40 - 41
  • [4] Memory on the racetrack
    Parkin, Stuart
    Yang, See-Hun
    NATURE NANOTECHNOLOGY, 2015, 10 (03) : 195 - 198
  • [5] Accelerating Graph Computation with Racetrack Memory and Pointer-Assisted Graph Representation
    Park, Eunhyuk
    Yoo, Sungjoo
    Lee, Sunggu
    Li, Helen
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [6] Multibit racetrack memory
    Deger, C.
    NANOTECHNOLOGY, 2020, 31 (49)
  • [7] Dynamic Memory Computation of Impedance Matrix Method
    Kim, Sang Hyun
    JOURNAL OF HYDRAULIC ENGINEERING, 2011, 137 (01) : 122 - 128
  • [8] Magnetic vortex racetrack memory
    Geng, Liwei D.
    Jin, Yongmei M.
    JOURNAL OF MAGNETISM AND MAGNETIC MATERIALS, 2017, 423 : 84 - 89
  • [9] Skyrmion racetrack memory with an antidot
    Kumar Behera, Aroop
    Murapaka, Chandrasekhar
    Mallick, Sougata
    Bhusan Singh, Braj
    Bedanta, Subhankar
    JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2021, 54 (02)
  • [10] In-memory Distributed Matrix Computation Processing and Optimization
    Yu, Yongyang
    Tang, Mingjie
    Aref, Walid G.
    Malluhi, Qutaibah M.
    Abbas, Mostafa M.
    Ouzzani, Mourad
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1047 - 1058