PIE: A Pipeline Energy-efficient Accelerator for Inference Process in Deep Neural Networks

被引：0

作者：

Zhao, Yangyang ^{[1
]}

Yu, Qi ^{[1
]}

Zhou, Xuda ^{[1
]}

Zhou, Xuehai ^{[1
]}

Wang, Chao ^{[1
]}

Li, Xi ^{[1
]}

机构：

[1] USTC, Dept Comp Sci & Technol, Hefei, Peoples R China

来源：

2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2016年

基金：

美国国家科学基金会;

关键词：

accelerator; deep neural networks; FPGA; pipeline; inference;

D O I：

10.1109/ICPADS.2016.139

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

It has been a new research hot topic to speed up the inference process of deep neural networks (DNNs) by hardware accelerators based on field programmable gate arrays (FPGAs). Because of the layer-wise structure and data dependency between layers, previous studies commonly focus on the inherent parallelism of a single layer to reduce the computation time but neglect the parallelism between layers. In this paper, we propose a pipeline energy-efficient accelerator named PIE to accelerate the DNN inference computation by pipelining two adjacent layers. Through realizing two adjacent layers in different calculation orders, the data dependency between layers can be weakened. As soon as a layer produces an output, the next layer reads the output as an input and starts the parallel computation immediately in another calculation method. In such a way, computations between adjacent layers are pipelined. We conduct our experiments on a Zedboard development kit using Xilinx Zynq-7000 FPGA, compared with Intel Core i7 4.0GHz CPU and NVIDIA K40C GPU. Experimental results indicate that PIE is 4.82x faster than CPU and can reduce the energy consumptions of CPU and GPU by 355.35x and 12.02x respectively. Besides, compared with the none-pipelined method that layers are processed in serial, PIE improves the performance by nearly 50%.

引用

页码：1067 / 1074

页数：8

共 50 条

[1] Energy-Efficient Inference Accelerator for Memory-Augmented Neural Networks on an FPGA
Park, Seongsik
Jang, Jaehee
Kim, Seijoon
Yoon, Sungroh
[J]. 2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 1587 - 1590
[2] Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks
Chen, Yu-Hsin
Krishna, Tushar
Emer, Joel
Sze, Vivienne
[J]. 2016 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE (ISSCC), 2016, 59 : 262 - U363
[3] Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks
Chen, Yu-Hsin
Krishna, Tushar
Emer, Joel S.
Sze, Vivienne
[J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) : 127 - 138
[4] Vesti: Energy-Efficient In-Memory Computing Accelerator for Deep Neural Networks
Yin, Shihui
Jiang, Zhewei
Kim, Minkyu
Gupta, Tushar
Seok, Mingoo
Seo, Jae-Sun
[J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 48 - 61
[5] An Energy-Efficient Deep Neural Network Accelerator Design
Jung, Jueun
Lee, Kyuho Jason
[J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 272 - 276
[6] DeepCAM: A Fully CAM-based Inference Accelerator with Variable Hash Lengths for Energy-efficient Deep Neural Networks
Duy-Thanh Nguyen
Bhattacharjee, Abhiroop
Moitra, Abhishek
Panda, Priyadarshini
[J]. 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
[7] SONIC: A Sparse Neural Network Inference Accelerator with Silicon Photonics for Energy-Efficient Deep Learning
Sunny, Febin
Nikdast, Mandi
Pasricha, Sudeep
[J]. 27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 214 - 219
[8] SHARP: An Adaptable, Energy-Efficient Accelerator for Recurrent Neural Networks
Aminabadi, Reza Yazdani
Ruwase, Olatunji
Zhang, Minjia
He, Yuxiong
Arnau, Jose-Maria
Gonzalez, Antonio
[J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (02)
[9] An Energy-Efficient and Flexible Accelerator based on Reconfigurable Computing for Multiple Deep Convolutional Neural Networks
Yang, Chen
Zhang, HaiBo
Wang, XiaoLi
Geng, Li
[J]. 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2018, : 1389 - 1391
[10] Poster Abstract: MicroBrain: Compressing Deep Neural Networks for Energy-efficient Visual Inference Service
Ge, Shiming
Luo, Zhao
Ye, Qiting
Zhang, Xiao-Yu
[J]. 2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 1000 - 1001

← 1 2 3 4 5 →