Active Memory Cube: A processing-in-memory architecture for exascale systems

被引：128

作者：

Nair, R. ^{[1
]}

Antao, S. F. ^{[1
]}

Bertolli, C. ^{[1
]}

Bose, P. ^{[1
]}

Brunheroto, J. R. ^{[1
]}

Chen, T. ^{[1
]}

Cher, C. -Y. ^{[1
]}

Costa, C. H. A. ^{[1
]}

Doi, J. ^{[2
]}

Evangelinos, C. ^{[3
]}

Fleischer, B. M. ^{[1
]}

Fox, T. W. ^{[1
]}

Gallo, D. S. ^{[4
]}

Grinberg, L. ^{[5
]}

Gunnels, J. A. ^{[1
]}

Jacob, A. C. ^{[1
]}

Jacob, P. ^{[1
]}

Jacobson, H. M. ^{[1
]}

Karkhanis, T. ^{[1
]}

Kim, C. ^{[1
]}

Moreno, J. H. ^{[1
]}

O'Brien, J. K. ^{[1
]}

Ohmacht, M. ^{[1
]}

Park, Y. ^{[1
]}

Prener, D. A. ^{[1
]}

Rosenburg, B. S. ^{[1
]}

Ryu, K. D. ^{[6
]}

Sallenave, O. ^{[1
]}

Serrano, M. J. ^{[1
]}

Siegl, P. D. M. ^{[7
]}

Sugavanam, K. ^{[1
]}

Sura, Z. ^{[1
]}

机构：

[1] IBM Res Div, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA

[2] IBM Res Div, Tokyo, Japan

[3] IBM Res Div, Cambridge, MA 02142 USA

[4] IBM Res Brazil, BR-04007900 Sao Paulo, Brazil

[5] IBM Res Div, Thomas J Watson Res Ctr, Cambridge, MA 02142 USA

[6] LG Elect, Software Platform Lab, Seoul, South Korea

[7] Tech Univ Carolo Wilhelmina Braunschweig, Chair Chip Design Embedded Comp C3E, D-38106 Braunschweig, Germany

来源：

IBM JOURNAL OF RESEARCH AND DEVELOPMENT | 2015年 / 59卷 / 2-3期

关键词：

Compendex;

D O I：

10.1147/JRD.2015.2409732

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many studies point to the difficulty of scaling existing computer architectures to meet the needs of an exascale system (i.e., capable of executing 1018 floating-point operations per second), consuming no more than 20 MW in power, by around the year 2020. This paper outlines a new architecture, the Active Memory Cube, which reduces the energy of computation significantly by performing computation in the memory module, rather than moving data through large memory hierarchies to the processor core. The architecture leverages a commercially demonstrated 3D memory stack called the Hybrid Memory Cube, placing sophisticated computational elements on the logic layer below its stack of dynamic random-access memory (DRAM) dies. The paper also describes an Active Memory Cube tuned to the requirements of a scientific exascale system. The computational elements have a vector architecture and are capable of performing a comprehensive set of floating-point and integer instructions, predicated operations, and gather-scatter accesses across memory in the Cube. The paper outlines the software infrastructure used to develop applications and to evaluate the architecture, and describes results of experiments on application kernels, along with performance and power projections.

引用

页数：14

共 50 条

[1] Optimization of OLAP In-Memory Database Management Systems with Processing-In-Memory Architecture
Hosseinzadeh, Shima
Parvaresh, Amirhossein
Fey, Dietmar
ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2023, 2023, 13949 : 264 - 278
[2] PIMCH: Cooperative Memory Prefetching in Processing-In-Memory Architecture
Xui, Sheng
Wang, Ying
Han, Yinhe
Li, Xiaowei
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 209 - 214
[3] Heterogeneous Memory Architecture Accommodating Processing-In-Memory on SoC For AIoT Applications
Qiu, Kangyi
Zhang, Yaojun
Yan, Bonan
Huang, Ru
27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 383 - 388
[4] Towards Memory-Efficient Allocation of CNNs on Processing-in-Memory Architecture
Wang, Yi
Chen, Weixuan
Yang, Jing
Li, Tao
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (06) : 1428 - 1441
[5] Reconfigurable Processing-in-Memory Architecture for Data Intensive Applications
Bavikadi, Sathwika
Sutradhar, Purab Ranjan
Ganguly, Amlan
Dinakarrao, Sai Manoj Pudukotai
PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, VLSID 2024 AND 23RD INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, ES 2024, 2024, : 222 - 227
[6] Study on Processing-in-Memory Technology based on Dataflow Architecture
Choi, Kyu Hyun
Hwang, Taeho
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[7] Exploiting Parallelism for Convolutional Connections in Processing-In-Memory Architecture
Wang, Yi
Zhang, Mingxu
Yang, Jing
PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
[8] A Customized Processing-in-Memory Architecture for Biological Sequence Alignment
Akbari, Nasrin
Modarressi, Mehdi
Daneshtalab, Masoud
Loni, Mohammad
2018 IEEE 29TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2018, : 158 - 165
[9] Towards Memory-Efficient Processing-in-Memory Architecture for Convolutional Neural Networks
Wang, Yi
Zhang, Mingxu
Yang, Jing
ACM SIGPLAN NOTICES, 2017, 52 (05) : 81 - 90
[10] Implementation of a Low-Overhead Processing-in-Memory Architecture
Jang, Young-Jong
Kim, Byung-Soo
Kim, Dong-Sun
Hwang, Tae-ho
2016 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2016, : 185 - 186

← 1 2 3 4 5 →