Active Memory Cube: A processing-in-memory architecture for exascale systems

被引:128
|
作者
Nair, R. [1 ]
Antao, S. F. [1 ]
Bertolli, C. [1 ]
Bose, P. [1 ]
Brunheroto, J. R. [1 ]
Chen, T. [1 ]
Cher, C. -Y. [1 ]
Costa, C. H. A. [1 ]
Doi, J. [2 ]
Evangelinos, C. [3 ]
Fleischer, B. M. [1 ]
Fox, T. W. [1 ]
Gallo, D. S. [4 ]
Grinberg, L. [5 ]
Gunnels, J. A. [1 ]
Jacob, A. C. [1 ]
Jacob, P. [1 ]
Jacobson, H. M. [1 ]
Karkhanis, T. [1 ]
Kim, C. [1 ]
Moreno, J. H. [1 ]
O'Brien, J. K. [1 ]
Ohmacht, M. [1 ]
Park, Y. [1 ]
Prener, D. A. [1 ]
Rosenburg, B. S. [1 ]
Ryu, K. D. [6 ]
Sallenave, O. [1 ]
Serrano, M. J. [1 ]
Siegl, P. D. M. [7 ]
Sugavanam, K. [1 ]
Sura, Z. [1 ]
机构
[1] IBM Res Div, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] IBM Res Div, Tokyo, Japan
[3] IBM Res Div, Cambridge, MA 02142 USA
[4] IBM Res Brazil, BR-04007900 Sao Paulo, Brazil
[5] IBM Res Div, Thomas J Watson Res Ctr, Cambridge, MA 02142 USA
[6] LG Elect, Software Platform Lab, Seoul, South Korea
[7] Tech Univ Carolo Wilhelmina Braunschweig, Chair Chip Design Embedded Comp C3E, D-38106 Braunschweig, Germany
关键词
Compendex;
D O I
10.1147/JRD.2015.2409732
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many studies point to the difficulty of scaling existing computer architectures to meet the needs of an exascale system (i.e., capable of executing 1018 floating-point operations per second), consuming no more than 20 MW in power, by around the year 2020. This paper outlines a new architecture, the Active Memory Cube, which reduces the energy of computation significantly by performing computation in the memory module, rather than moving data through large memory hierarchies to the processor core. The architecture leverages a commercially demonstrated 3D memory stack called the Hybrid Memory Cube, placing sophisticated computational elements on the logic layer below its stack of dynamic random-access memory (DRAM) dies. The paper also describes an Active Memory Cube tuned to the requirements of a scientific exascale system. The computational elements have a vector architecture and are capable of performing a comprehensive set of floating-point and integer instructions, predicated operations, and gather-scatter accesses across memory in the Cube. The paper outlines the software infrastructure used to develop applications and to evaluate the architecture, and describes results of experiments on application kernels, along with performance and power projections.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Optimization of OLAP In-Memory Database Management Systems with Processing-In-Memory Architecture
    Hosseinzadeh, Shima
    Parvaresh, Amirhossein
    Fey, Dietmar
    ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2023, 2023, 13949 : 264 - 278
  • [2] PIMCH: Cooperative Memory Prefetching in Processing-In-Memory Architecture
    Xui, Sheng
    Wang, Ying
    Han, Yinhe
    Li, Xiaowei
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 209 - 214
  • [3] Heterogeneous Memory Architecture Accommodating Processing-In-Memory on SoC For AIoT Applications
    Qiu, Kangyi
    Zhang, Yaojun
    Yan, Bonan
    Huang, Ru
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 383 - 388
  • [4] Towards Memory-Efficient Allocation of CNNs on Processing-in-Memory Architecture
    Wang, Yi
    Chen, Weixuan
    Yang, Jing
    Li, Tao
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (06) : 1428 - 1441
  • [5] Reconfigurable Processing-in-Memory Architecture for Data Intensive Applications
    Bavikadi, Sathwika
    Sutradhar, Purab Ranjan
    Ganguly, Amlan
    Dinakarrao, Sai Manoj Pudukotai
    PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, VLSID 2024 AND 23RD INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, ES 2024, 2024, : 222 - 227
  • [6] Study on Processing-in-Memory Technology based on Dataflow Architecture
    Choi, Kyu Hyun
    Hwang, Taeho
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [7] Exploiting Parallelism for Convolutional Connections in Processing-In-Memory Architecture
    Wang, Yi
    Zhang, Mingxu
    Yang, Jing
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [8] A Customized Processing-in-Memory Architecture for Biological Sequence Alignment
    Akbari, Nasrin
    Modarressi, Mehdi
    Daneshtalab, Masoud
    Loni, Mohammad
    2018 IEEE 29TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2018, : 158 - 165
  • [9] Towards Memory-Efficient Processing-in-Memory Architecture for Convolutional Neural Networks
    Wang, Yi
    Zhang, Mingxu
    Yang, Jing
    ACM SIGPLAN NOTICES, 2017, 52 (05) : 81 - 90
  • [10] Implementation of a Low-Overhead Processing-in-Memory Architecture
    Jang, Young-Jong
    Kim, Byung-Soo
    Kim, Dong-Sun
    Hwang, Tae-ho
    2016 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2016, : 185 - 186