Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [31] Dash: Accelerating Distributed Private Convolutional Neural Network Inference with Arithmetic Garbled Circuits
    Sander, Jonas
    Berndt, Sebastian
    Bruhns, Ida
    Eisenbarth, Thomas
    IACR Transactions on Cryptographic Hardware and Embedded Systems, 2025, 2025 (01): : 420 - 449
  • [32] F3D: Accelerating 3D Convolutional Neural Networks in Frequency Space Using ReRAM
    Liu, Bosheng
    Jiang, Zhuoshen
    Wu, Jigang
    Chen, Xiaoming
    Han, Yinhe
    Liu, Peng
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 571 - 576
  • [33] Octant Convolutional Neural Network for 3D Point Cloud Analysis
    Xu X.
    Shuai H.
    Liu Q.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (12): : 2791 - 2800
  • [34] Classification of pressure ulcer tissues with 3D convolutional neural network
    Begoña García-Zapirain
    Mohammed Elmogy
    Ayman El-Baz
    Adel S. Elmaghraby
    Medical & Biological Engineering & Computing, 2018, 56 : 2245 - 2258
  • [35] MwoA auxiliary diagnosis using 3D convolutional neural network
    Li, Xiang
    Wei, Benzheng
    Wu, Hongyun
    Li, Xuzhou
    Cong, Jinyu
    2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2020,
  • [36] Classification of pressure ulcer tissues with 3D convolutional neural network
    Garcia-Zapirain, Begona
    Elmogy, Mohammed
    El-Baz, Ayman
    Elmaghraby, Adel S.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2018, 56 (12) : 2245 - 2258
  • [37] LUNG CT SCREENING WITH 3D CONVOLUTIONAL NEURAL NETWORK ARCHITECTURES
    Barbosa Lima, Thiago Jose
    Ushizima, Daniela
    de Carvalho Filho, Antonio Oseas
    Duarte de Araujo, Flavio Henrique
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING WORKSHOPS (IEEE ISBI WORKSHOPS 2020), 2020,
  • [38] A 3D Convolutional Neural Network for Volumetric Image Semantic Segmentation
    Lu, Hongya
    Wang, Haifeng
    Zhang, Qianqian
    Yoon, Sang Won
    Won, Daehan
    25TH INTERNATIONAL CONFERENCE ON PRODUCTION RESEARCH MANUFACTURING INNOVATION: CYBER PHYSICAL MANUFACTURING, 2019, 39 : 422 - 428
  • [39] Learning Human Motion Intention with 3D Convolutional Neural Network
    Owoyemi, Joshua
    Hashimoto, Koichi
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 1810 - 1815
  • [40] 3D Convolutional Neural Network for Predicting Free Energies of Partitioning
    He, Stewart
    Ingolfsson, Helgi
    Sun, Delin
    Bennett, W. F. Drew
    Allen, Jonathan
    Lightstone, Felice C.
    Bilodeau, Camille
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 461A - 462A