On Continuing DNN Accelerator Architecture Scaling Using Tightly Coupled Compute-on-Memory 3-D ICs

被引:2
|
作者
Murali, Gauthaman [1 ]
Iyer, Aditya [1 ]
Zhu, Lingjun [1 ]
Tong, Jianming [1 ]
Martinez, Francisco Munoz [2 ]
Srinivasa, Srivatsa Rangachar [3 ]
Karnik, Tanay [3 ]
Krishna, Tushar [1 ]
Lim, Sung Kyu [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30308 USA
[2] Univ Murcia, Fac Inform, Dept Ingn & Tecnol Comp, Campus Espinardo, Murcia 30100, Spain
[3] Intel Corp, Hillsboro, OR 97124 USA
关键词
3-D accelerator physical design; 3-D bond pitch study for accelerators; high-performance 3-D accelerator; multitier 3-D ML accelerator; SPACE EXPLORATION;
D O I
10.1109/TVLSI.2023.3299564
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This work identifies the architectural and design scaling limits of 2-D flexible interconnect deep neural network (DNN) accelerators and addresses them with 3-D ICs. We demonstrate how scaling up a baseline 2-D accelerator in the X/ Y dimension fails and how vertical stacking effectively overcomes the failure. We designed multitier accelerators that are 1.67x faster than the 2-D design. Using our 3-D architecture and circuit codesign methodology, we improve throughput, energy efficiency, and area efficiency by up to 5x, 1.2x, and 3.9x, respectively, over 2-D counterparts. The IR-drop in our 3-D designs is within 10.7% of VDD, and the temperature variation is within 12 degrees C.
引用
收藏
页码:1603 / 1613
页数:11
相关论文
共 12 条
  • [1] A TSV Repair Scheme Using Enhanced Test Access Architecture for 3-D ICs
    Yang, Chi-Chun
    Chou, Che-Wei
    Li, Jin-Fu
    2013 22ND ASIAN TEST SYMPOSIUM (ATS), 2013, : 7 - 12
  • [2] Exploring the Feasibility of Using 3-D XPoint as an In-Memory Computing Accelerator
    Zabihi, Masoud
    Resch, Salonik
    Cilasun, Husrev
    Chowdhury, Zamshed, I
    Zhao, Zhengyang
    Karpuzcu, Ulya R.
    Wang, Jian-Ping
    Sapatnekar, Sachin S.
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2021, 7 (02): : 88 - 96
  • [3] Recent Advances in Compute-in-Memory Support for SRAM Using Monolithic 3-D Integration
    Zhang, Zhixiao
    Si, Xin
    Srinivasa, Srivatsa
    Ramanathan, Akshay Krishna
    Chang, Meng-Fan
    IEEE MICRO, 2019, 39 (06) : 28 - 37
  • [4] A Heterogeneous and Programmable Compute-In-Memory Accelerator Architecture for Analog-AI Using Dense 2-D Mesh
    Jain, Shubham
    Tsai, Hsinyu
    Chen, Ching-Tzu
    Muralidhar, Ramachandran
    Boybat, Irem
    Frank, Martin M.
    Wozniak, Stanislaw
    Stanisavljevic, Milos
    Adusumilli, Praneet
    Narayanan, Pritish
    Hosokawa, Kohji
    Ishii, Masatoshi
    Kumar, Arvind
    Narayanan, Vijay
    Burr, Geoffrey W.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (01) : 114 - 127
  • [5] Power-Aware Neuromorphic Architecture With Partial Voltage Scaling 3-D Stacking Synaptic Memory
    Nguyen, Ngo-Doanh
    Ben Ahmed, Akram
    Ben Abdallah, Abderazek
    Dang, Khanh N.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (12) : 2016 - 2029
  • [6] Energy-/Carbon-Aware Evaluation and Optimization of 3-D IC Architecture With Digital Compute-in-Memory Designs
    Byun, Hyung Joon
    Gupta, Udit
    Seo, Jae-Sun
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2024, 10 : 98 - 106
  • [7] Memory-Efficient Architecture for 3-D DWT Using Overlapped Grouping of Frames
    Mohanty, Basant K.
    Meher, Pramod K.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (11) : 5605 - 5616
  • [8] A Scalable and Reconfigurable Bit-Serial Compute-Near-Memory Hardware Accelerator for Solving 2-D/3-D Partial Differential Equations
    Mu, Junjie
    Yu, Chengshuo
    Kim, Tony Tae-Hyoung
    Kim, Bongjin
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 59 (08) : 2706 - 2716
  • [9] H3DAtten: Heterogeneous 3-D Integrated Hybrid Analog and Digital Compute-in-Memory Accelerator for Vision Transformer Self-Attention
    Li, Wantong
    Manley, Madison
    Read, James
    Kaul, Ankit
    Bakir, Muhannad S.
    Yu, Shimeng
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (10) : 1592 - 1602
  • [10] Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads
    Luo, Yuan-Chun
    Lu, Anni
    Sharda, Janak
    Scherer, Moritz
    Gomez, Jorge Tomas
    Sarwar, Syed Shakib
    Li, Ziyun
    Pinkham, Reid Frederick
    Salvo, Barbara De
    Yu, Shimeng
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (09) : 1718 - 1725