3DL-PIM: A Look-Up Table Oriented Programmable Processing in Memory Architecture Based on the 3-D Stacked Memory for Data-Intensive Applications

被引:2
|
作者
Sutradhar, Purab Ranjan [1 ]
Bavikadi, Sathwika [3 ]
Dinakarrao, Sai Manoj Pudukotai [3 ]
Indovina, Mark A. [2 ]
Ganguly, Amlan [1 ]
机构
[1] Rochester Inst Technol, Dept Comp Engn, Rochester, NY 14623 USA
[2] Rochester Inst Technol, Dept Elect & Microelect Engn, Rochester, NY 14623 USA
[3] George Mason Univ, Dept Elect & Comp Engn, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
3-D memory; data encryption; deep neural networks; look-up table; parallel processing; processing-in-memory; DRAM;
D O I
10.1109/TETC.2023.3293140
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Memory-centric computing systems have demonstrated superior performance and efficiency in memory-intensive applications compared to state-of-the-art CPUs and GPUs. 3-D stacked DRAM architectures unlock higher I/O data bandwidth than the traditional 2-D memory architecture and therefore are better suited for incorporating memory-centric processors. However, merely integrating high-precision ALUs in the 3-D stacked memory does not ensure an optimized design since such a design can only achieve a limited utilization of the internal bandwidth of a memory chip and limited operational parallelization. To address this, we propose 3DL-PIM, a 3-D stacked memory-based Processing in Memory (PIM) architecture that locates a plurality of Look-up Table (LUT)-based low-footprint Processing Elements (PE) within the memory banks in order to achieve high parallel computing performance by maximizing data-bandwidth utilization. Instead of relying on the traditional logic-based ALUs, the PEs are formed by clustering a group of programmable LUTs and therefore can be programmed on-the-fly to perform various logic/arithmetic operations. Our simulations show that 3DL-PIM can achieve respectively up to 2.6x higher processing performance at 2.65x higher area efficiency compared to a state-of-the-art 3-D stacked memory-based accelerator.
引用
收藏
页码:60 / 72
页数:13
相关论文
共 13 条
  • [1] An Ultra-efficient Look-up Table based Programmable Processing in Memory Architecture for Data Encryption
    Sutradhar, Purab Ranjan
    Basu, Kanad
    Dinakarrao, Sai Manoj Pudukotai
    Ganguly, Amlan
    [J]. 2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 252 - 259
  • [2] Flexible Instruction Set Architecture for Programmable Look-up Table based Processing-in-Memory
    Connolly, Mark
    Sutradhar, Purab Ranjan
    Indovina, Mark
    Ganguly, Amlan
    [J]. 2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 66 - 73
  • [3] Exploiting GPU with 3D Stacked Memory to Boost Performance for Data-Intensive Applications
    Wen, Hao
    Zhang, Wei
    [J]. 2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [4] 3D Flash Memory for Data-intensive Applications
    Inaba, Satoshi
    [J]. 2018 IEEE 10TH INTERNATIONAL MEMORY WORKSHOP (IMW), 2018, : 1 - 4
  • [5] Look-up-Table Based Processing-in-Memory Architecture With Programmable Precision-Scaling for Deep Learning Applications
    Sutradhar, Purab Ranjan
    Bavikadi, Sathwika
    Connolly, Mark
    Prajapati, Savankumar
    Indovina, Mark A.
    Dinakarrao, Sai Manoj Pudukotai
    Ganguly, Amlan
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (02) : 263 - 275
  • [6] FlutPIM: A Look-up Table-based Processing in Memory Architecture with Floating-point Computation Support for Deep Learning Applications
    Sutradhar, Purab Ranjan
    Bavikadi, Sathwika
    Indovina, Mark
    Dinakarrao, Sai Manoj Pudukotai
    Ganguly, Amlan
    [J]. PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2023, GLSVLSI 2023, 2023, : 207 - 211
  • [7] LUTSim: A Look-Up Table-Based Thermal Simulator for 3-D ICs
    Lee, Yu-Min
    Pan, Chi-Wen
    Huang, Pei-Yu
    Yang, Chi-Ping
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (08) : 1250 - 1263
  • [8] CiM3D: Comparator-in-Memory Designs Using Monolithic 3-D Technology for Accelerating Data-Intensive Applications
    Ramanathan, Akshay Krishna
    Rangachar, Srivatsa Srinivasa
    Govindarajan, Hariram Thirucherai
    Hung, Je-Min
    Lee, Chun-Ying
    Xue, Cheng-Xin
    Huang, Sheng-Po
    Hsueh, Fu-Kuo
    Shen, Chang-Hong
    Shieh, Jia-Min
    Yeh, Wen-Kuan
    Ho, Mon-Shu
    Sampson, Jack
    Chang, Meng-Fan
    Narayanan, Vijaykrishnan
    [J]. IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2021, 7 (01): : 79 - 87
  • [9] Novel Hybrid Computing Architecture with Memristor-Based Processing-in-Memory for Data-Intensive Applications
    Zhang, Xunming
    Zhang, Quan
    Yang, Jianguo
    Wangchen, Zedai
    Jing, Ming'e
    Wang, Mingyu
    Zeng, Xiaoyang
    Xue, Xiaoyong
    [J]. 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1190 - 1192
  • [10] I-LUTSim: An Iterative Look-Up Table Based Thermal Simulator for 3-D ICs
    Pan, Chi-Wen
    Lee, Yu-Min
    Huang, Pei-Yu
    Yang, Chi-Ping
    Lin, Chang-Tzu
    Lee, Chia-Hsin
    Chou, Yung-Fa
    Kwai, Ding-Ming
    [J]. 2013 18TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2013, : 151 - 156