Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [21] Human Action Recognition with 3D Convolutional Neural Network
    Lima, Tiago
    Fernandes, Bruno
    Barros, Pablo
    2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
  • [22] RECOGNIZING CHINESE TEXTS WITH 3D CONVOLUTIONAL NEURAL NETWORK
    Chen, Kuan-Chou
    Lin, Guan-Ting
    Lin, Che-Tsung
    Guo, Jiun-In
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2120 - 2123
  • [23] A lightweight 3D convolutional neural network for deepfake detection
    Liu, Jiarui
    Zhu, Kaiman
    Lu, Wei
    Luo, Xiangyang
    Zhao, Xianfeng
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (09) : 4990 - 5004
  • [24] Fast 3D lithography simulation by convolutional neural network
    Tanabe, Hiroyoshi
    Sato, Shimpei
    Takahashi, Atsushi
    DESIGN-PROCESS-TECHNOLOGY CO-OPTIMIZATION XV, 2021, 11614
  • [25] 3D Face Reconstruction Based on Convolutional Neural Network
    Li Fangmin
    Chen Ke
    Liu Xinhua
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 71 - 74
  • [26] 3D convolutional neural network for object recognition: a review
    Singh, Rahul Dev
    Mittal, Ajay
    Bhatia, Rajesh K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
  • [27] Fast FFT-Based Inference in 3D Convolutional Neural Networks
    Xie, Bo
    Zhang, Guidong
    Shen, Yongjun
    Liu, Shun
    Ge, Yabin
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 420 - 431
  • [28] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Mohammad Reza Falahzadeh
    Fardad Farokhi
    Ali Harimi
    Reza Sabbaghi-Nadooshan
    Circuits, Systems, and Signal Processing, 2023, 42 : 4271 - 4291
  • [29] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
    Falahzadeh, Mohammad Reza
    Farokhi, Fardad
    Harimi, Ali
    Sabbaghi-Nadooshan, Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4271 - 4291
  • [30] 3D Sketch-based 3D Model Retrieval with Convolutional Neural Network
    Ye, Yuxiang
    Li, Bo
    Lu, Yijuan
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2936 - 2941