Exploring Specialized Near-Memory Processing for Data Intensive Operations

被引:0
|
作者
Yitbarek, Salessawi Ferede [1 ]
Yang, Tao [2 ]
Das, Reetuparna [1 ]
Austin, Todd [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Univ Calif San Diego, San Diego, CA 92103 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emerging 3D stacked memory systems provide significantly more bandwidth than current DDR modules. However, general purpose processors do not take full advantage of these resources offered by the memory modules. Taking advantage of the increased bandwidth requires the use of specialized processing units. In this paper, we evaluate the benefits of placing hardware accelerators at the bottom layer of a 3D stacked memory system compared to accelerators that are placed external to the memory stack. Our evaluation of the design using cycle-accurate simulation and RTL synthesis shows that, for important data intensive kernels, near-memory accelerators inside a single 3D memory package provide 3x-13x speedup over a Quad-core Xeon processor. Most of the benefits are from the application of accelerators, as the near-memory configurations provide marginal benefits compared to the same number of accelerators placed on a die external to the memory package. This comparable performance for external accelerators is due to the high bandwidth afforded by the high-speed off-chip links. On the other hand, near-memory accelerators consume 7%-39% less energy than the external accelerators.
引用
收藏
页码:1449 / 1452
页数:4
相关论文
共 50 条
  • [1] Charon: Specialized Near-Memory Processing Architecture for Clearing Dead Objects in Memory
    Jang, Jaeyoung
    Heo, Jun
    Lee, Yejin
    Won, Jaeyeon
    Kim, Seonghak
    Jung, Sung Jun
    Hakbeom, Jang
    Ham, Tae Jun
    Lee, Jae Woo
    MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 726 - 739
  • [2] Near-Memory Data Services
    Falsafi, Babak
    IEEE MICRO, 2016, 36 (01) : 6 - 7
  • [3] TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning
    Kwon, Youngeun
    Lee, Yunjae
    Rhu, Minsoo
    MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 740 - 753
  • [4] Special Issue on Near-Memory and In-Memory Processing
    Pande, Partha Pratim
    IEEE DESIGN & TEST, 2022, 39 (02) : 4 - 4
  • [5] Near-Memory Processing Offload to Remote (Persistent) Memory
    Kisous, Roei
    Golander, Amit
    Korman, Yigal
    Gubner, Tim
    Humborstad, Rune
    Lu, Manyi
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023, 2023, : 136 - 136
  • [6] Near-Memory Processing Offload to Remote (Persistent) Memory
    Kisous, Roei
    Golander, Amit
    Korman, Yigal
    Gubner, Tim
    Humborstad, Rune
    Lu, Manyi
    Proceedings of the 16th ACM International Conference on Systems and Storage, SYSTOR 2023, 2023,
  • [7] AOI-Based Data-Centric Circuits for Near-Memory Processing
    Junsangsri, Salin
    Lombardi, Fabrizio
    PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON NANOSCALE ARCHITECTURES (NANOARCH 2017), 2017, : 7 - 12
  • [8] A Survey of Resource Management for Processing-In-Memory and Near-Memory Processing Architectures
    Khan, Kamil
    Pasricha, Sudeep
    Kim, Ryan Gary
    JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2020, 10 (04) : 1 - 31
  • [9] FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications
    Singh, Gagandeep
    Alser, Mohammed
    Cali, Damla Senol
    Diamantopoulos, Dionysios
    Gomez-Luna, Juan
    Corporaal, Henk
    Mutlu, Onur
    IEEE MICRO, 2021, 41 (04) : 39 - 48
  • [10] Scalable Phylogeny Reconstruction with Disaggregated Near-memory Processing
    Alachiotis, Nikolaos
    Skrimponis, Panagiotis
    Pissadakis, Manolis
    Pnevmatikatos, Dionisios
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (03)