Exploring Specialized Near-Memory Processing for Data Intensive Operations

被引:0
|
作者
Yitbarek, Salessawi Ferede [1 ]
Yang, Tao [2 ]
Das, Reetuparna [1 ]
Austin, Todd [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emerging 3D stacked memory systems provide significantly more bandwidth than current DDR modules. However, general purpose processors do not take full advantage of these resources offered by the memory modules. Taking advantage of the increased bandwidth requires the use of specialized processing units. In this paper, we evaluate the benefits of placing hardware accelerators at the bottom layer of a 3D stacked memory system compared to accelerators that are placed external to the memory stack. Our evaluation of the design using cycle-accurate simulation and RTL synthesis shows that, for important data intensive kernels, near-memory accelerators inside a single 3D memory package provide 3x-13x speedup over a Quad-core Xeon processor. Most of the benefits are from the application of accelerators, as the near-memory configurations provide marginal benefits compared to the same number of accelerators placed on a die external to the memory package. This comparable performance for external accelerators is due to the high bandwidth afforded by the high-speed off-chip links. On the other hand, near-memory accelerators consume 7%-39% less energy than the external accelerators.
引用
收藏
页码:1449 / 1452
页数:4
相关论文
共 50 条
  • [31] A Precision -Optimized Fixed -Point Near-Memory Digital Processing Unit for Analog In -Memory Computing
    Ferro, Elena
    Vasilopoulos, Athanasios
    Lammie, Corey
    Le Gallo, Manuel
    Benini, Luca
    Boybat, Irem
    Sebastian, Abu
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [32] GATe: Streamlining Memory Access and Communication to Accelerate Graph Attention Network With Near-Memory Processing
    Yi, Shiyan
    Qiu, Yudi
    Lu, Lingfei
    Xu, Guohao
    Gong, Yong
    Zeng, Xiaoyang
    Fan, Yibo
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (01) : 87 - 90
  • [33] Cache Register Sharing Structure for Channel-level Near-memory Processing in NAND Flash Memory
    Kim, Hyunwoo
    Lee, Hyundong
    Kim, Jongbeom
    Go, Yunjeong
    Baek, Seungwon
    Song, Jaehong
    Kim, Junhyeon
    Jung, Minyoung
    Kim, Hyodong
    Kim, Seongju
    Song, Taigon
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 718 - 723
  • [34] Coherently Attached Programmable Near-Memory Acceleration Platform and its application to Stencil Processing
    van Lunteren, Jan
    Luijten, Ronald
    Diamantopoulos, Dionysios
    Auernhammer, Florian
    Hagleitner, Christoph
    Chelini, Lorenzo
    Corda, Stefano
    Singh, Gagandeep
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 668 - 673
  • [35] INVITED: Enabling Practical Processing in and near Memory for Data-Intensive Computing
    Mutlu, Onur
    Ghose, Saugata
    Gomez-Luna, Juan
    Ausavarungnirun, Rachata
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
  • [36] Near-Memory & In-Memory Detection of Fileless Malware
    Botacin, Marcus
    Gregio, Andre
    Alves, Marco Zanata
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2020, 2020, : 23 - 38
  • [37] Near-memory caching for improved energy consumption
    AbouGhazaleh, Nevine
    Childers, Bruce R.
    Mosse, Daniel
    Melhem, Rami G.
    IEEE TRANSACTIONS ON COMPUTERS, 2007, 56 (11) : 1441 - 1455
  • [38] MetaNMP: Leveraging Cartesian-Like Product to Accelerate HGNNs with Near-Memory Processing
    Chen, Dan
    He, Haiheng
    Jin, Hai
    Zheng, Long
    Huang, Yu
    Shen, Xinyang
    Liao, Xiaofei
    PROCEEDINGS OF THE 2023 THE 50TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2023, 2023, : 784 - 796
  • [39] High-throughput Near-Memory Processing on CNNs with 3D HBM-like Memory
    Park, Naebeom
    Ryu, Sungju
    Kung, Jaeha
    Kim, Jae-Joon
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2021, 26 (06)
  • [40] NMPO: Near-Memory Computing Profiling and Offloading
    Corda, Stefano
    Kumaraswamy, Madhurya
    Awan, Ahsan Javed
    Jordans, Roel
    Kumar, Akash
    Corporaal, Henk
    2021 24TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2021), 2021, : 259 - 267