HMCSP: Reducing Transaction Latency of CSR-based SPMV in Hybrid Memory Cube

被引:1
|
作者
Qian, Cheng [1 ]
Childers, Bruce [2 ]
Huang, Libo [1 ]
Yu, Qi [1 ]
Wang, Zhiying [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Univ Pittsburgh, Pittsburgh, PA 15260 USA
关键词
D O I
10.1109/ISPASS.2018.00021
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse Matrix Multiplication Vector (SPMV) plays a significant role in sparse linear algebra. Based on the high parallelization of matrix multiplication, SPMV has been accelerated with GPUs, Intel MIC, and FPGAs. The Micron Hybrid Memory Cube (HMC) is a highly parallel device that has atomic operations which support processing in memory (PIM). In this paper, we propose HMCSP, which extends the HMC's existing PIM capability to reduce the memory transaction latency of SPMV. By taking advantage of atomic operations and data prefetch, HMCSP reduces memory transaction latency of SPMV by 49.7% compared to a conventional HMC.
引用
收藏
页码:114 / 116
页数:3
相关论文
共 10 条
  • [1] Optimizing CSR-Based SpMV on a New MIMD Architecture Pezy-SC3s
    Guo, Jihu
    Liu, Jie
    Wang, Qinglin
    Zhu, Xiaoxiong
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 22 - 39
  • [2] An Accelerator for Resolution Proof Checking based on FPGA and Hybrid Memory Cube Technology
    Tim Hansmeier
    Marco Platzner
    Md Jubaer Hossain Pantho
    David Andrews
    [J]. Journal of Signal Processing Systems, 2019, 91 : 1259 - 1272
  • [3] An Accelerator for Resolution Proof Checking based on FPGA and Hybrid Memory Cube Technology
    Hansmeier, Tim
    Platzner, Marco
    Pantho, Md Jubaer Hossain
    Andrews, David
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (11-12): : 1259 - 1272
  • [4] Reducing Read Latency in Phase-Change RAM-based Main Memory
    Lee, Sung Kwang
    Yoo, Sungjoo
    Lee, Sunggu
    [J]. 2011 IEEE 54TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2011,
  • [5] MEG: A RISCV-based System Simulation Infrastructure for Exploring Memory Optimization Using FPGAs and Hybrid Memory Cube
    Zhang, Jialiang
    Liu, Yang
    Jain, Gaurav
    Zha, Yue
    Ta, Jonathan
    Li, Jing
    [J]. 2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, : 145 - 153
  • [6] Task Parallelism-Aware Deep Neural Network Scheduling on Multiple Hybrid Memory Cube-Based Processing-in-Memory
    Lee, Young Sik
    Han, Tae Hee
    [J]. IEEE ACCESS, 2021, 9 : 68561 - 68572
  • [7] CGAcc: A Compressed Sparse Row Representation-Based BFS Graph Traversal Accelerator on Hybrid Memory Cube
    Qian, Cheng
    Childers, Bruce
    Huang, Libo
    Guo, Hui
    Wang, Zhiying
    [J]. ELECTRONICS, 2018, 7 (11):
  • [8] DNN-based error level prediction for reducing read latency in 3D NAND flash memory
    He, Jing
    Yu, Xiaolei
    Li, Qianhui
    Zhang, Bo
    Wang, Xianliang
    Zhao, Qianqi
    Qiang, Xuhong
    Wang, Qi
    Huo, Zongliang
    Ye, Tianchun
    [J]. MICROELECTRONICS RELIABILITY, 2023, 147
  • [9] Boosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube: A Case for Breadth First Search
    Zhang, Jialiang
    Khoram, Soroosh
    Li, Jing
    [J]. FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 207 - 216
  • [10] Ge n-Channel Hybrid Memory Based on Ferroelectric Charge Trapping Layer with Low Operating Voltage, Large Memory Window, and Negligible Read Latency
    Chen, Yi-Fan
    Huang, Kai-Yang
    Kuo, Chun-Yi
    Wu, Yung-Hsien
    [J]. IEEE Electron Device Letters, 2024, 45 (10) : 1784 - 1787