Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引：0

作者：

Kung, H. T. ^{[1
]}

McDanel, Bradley ^{[1
]}

Zhang, Sai Qian ^{[1
]}

机构：

[1] Harvard Univ, Cambridge, MA 02138 USA

来源：

PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS) | 2018年

关键词：

systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.

引用

页码：330 / 336

页数：7

共 50 条

[21] Human Action Recognition with 3D Convolutional Neural Network
Lima, Tiago
Fernandes, Bruno
Barros, Pablo
2017 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2017,
[22] RECOGNIZING CHINESE TEXTS WITH 3D CONVOLUTIONAL NEURAL NETWORK
Chen, Kuan-Chou
Lin, Guan-Ting
Lin, Che-Tsung
Guo, Jiun-In
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2120 - 2123
[23] A lightweight 3D convolutional neural network for deepfake detection
Liu, Jiarui
Zhu, Kaiman
Lu, Wei
Luo, Xiangyang
Zhao, Xianfeng
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (09) : 4990 - 5004
[24] Fast 3D lithography simulation by convolutional neural network
Tanabe, Hiroyoshi
Sato, Shimpei
Takahashi, Atsushi
DESIGN-PROCESS-TECHNOLOGY CO-OPTIMIZATION XV, 2021, 11614
[25] 3D Face Reconstruction Based on Convolutional Neural Network
Li Fangmin
Chen Ke
Liu Xinhua
2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 71 - 74
[26] 3D convolutional neural network for object recognition: a review
Singh, Rahul Dev
Mittal, Ajay
Bhatia, Rajesh K.
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
[27] Fast FFT-Based Inference in 3D Convolutional Neural Networks
Xie, Bo
Zhang, Guidong
Shen, Yongjun
Liu, Shun
Ge, Yabin
INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 420 - 431
[28] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
Mohammad Reza Falahzadeh
Fardad Farokhi
Ali Harimi
Reza Sabbaghi-Nadooshan
Circuits, Systems, and Signal Processing, 2023, 42 : 4271 - 4291
[29] A 3D Tensor Representation of Speech and 3D Convolutional Neural Network for Emotion Recognition
Falahzadeh, Mohammad Reza
Farokhi, Fardad
Harimi, Ali
Sabbaghi-Nadooshan, Reza
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (07) : 4271 - 4291
[30] 3D Sketch-based 3D Model Retrieval with Convolutional Neural Network
Ye, Yuxiang
Li, Bo
Lu, Yijuan
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2936 - 2941

← 1 2 3 4 5 →