Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [1] Mapping of neural network models onto systolic arrays
    Mahapatra, S
    Mahapatra, RN
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2000, 60 (06) : 677 - 689
  • [2] Accelerating Convolutional Neural Network Inference Based on a Reconfigurable Sliced Systolic Array
    Zeng, Yixuan
    Sun, Heming
    Katto, Jiro
    Fan, Yibo
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [3] A 3D Implementation of Convolutional Neural Network for Fast Inference
    Miniskar, Narasinga Rao
    Vanna-iampikul, Pruek
    Young, Aaron
    Lim, Sung Kyu
    Liu, Frank
    Yoo, Jieun
    Mills, Corrinne
    Nhan Tran
    Fahim, Farah
    Vetter, Jeffrey S.
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [4] Gene Regulatory Network Inference using 3D Convolutional Neural Network
    Fan, Yue
    Ma, Xiuli
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 99 - 106
  • [5] Accelerating Sparse Convolutional Neural Networks with Systolic Arrays on FPGA
    Nehete, Hemkant
    Verma, Gaurav
    Yadav, Shailendra
    Kaushik, Brajesh Kumar
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
  • [6] Mapping 3-d IIR digital filter onto systolic arrays
    ElGuibaly, F
    Tawfik, A
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 1996, 7 (01) : 7 - 26
  • [7] Convolutional Bayesian Kernel Inference for 3D Semantic Mapping
    Wilson, Joey
    Fu, Yuewei
    Zhang, Arthur
    Song, Jingyu
    Capodieci, Andrew
    Jayakumar, Paramsothy
    Barton, Kira
    Ghaffari, Maani
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 8364 - 8370
  • [8] Detection and classification of faults in photovoltaic arrays using a 3D convolutional neural network
    Hong, Ying-Yi
    Pula, Rolando A.
    ENERGY, 2022, 246
  • [9] Accelerating 3D Convolutional Neural Networks Using 3D Fast Fourier Transform
    Fang, Chao
    He, Liulu
    Wang, Haonan
    Wei, Jinghe
    Wang, Zhongfeng
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [10] Kernel Mapping Methods of Convolutional Neural Network in 3D NAND Flash Architecture
    Song, Min Suk
    Hwang, Hwiho
    Lee, Geun Ho
    Ahn, Suhyeon
    Hwang, Sungmin
    Kim, Hyungjin
    ELECTRONICS, 2023, 12 (23)