Energy Efficient Boosting of GEMM Accelerators for DNN via Reuse

被引：2

作者：

Cicek, Nihat Mert ^{[1
]}

Shen, Xipeng ^{[2
]}

Ozturk, Ozcan ^{[3
]}

机构：

[1] Aselsan Corp, Mehmet Akif Ersoy Mahallesi Istiklal Marsi Caddes, TR-06200 Ankara, Turkey

[2] North Carolina State Univ, Dept Comp Sci, Coll Engn, 890 Oval Dr,Engn Bldg 2, Raleigh, NC 27695 USA

[3] Bilkent Univ, Comp Engn Dept, Ankara, Turkey

来源：

ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS | 2022年 / 27卷 / 05期

关键词：

Reuse; deep neural networks; gemm; accelerator; APPROXIMATE NEAREST-NEIGHBOR;

D O I：

10.1145/3503469

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reuse-centric convolutional neural networks (CNN) acceleration speeds up CNN inference by reusing computations for similar neuron vectors in CNN's input layer or activation maps. This new paradigm of optimizations is, however, largely limited by the overheads in neuron vector similarity detection, an important step in reuse-centric CNN. This article presents an in-depth exploration of architectural support for reuse-centric CNN. It addresses some major limitations of the state-of-the-art design and proposes a novel hardware accelerator that improves neuron vector similarity detection and reduces the energy consumption of reuse-centric CNN inference. The accelerator is implemented to support a wide variety of neural network settings with a banked memory subsystem. Design exploration is performed through RTL simulation and synthesis on an FPGA platform. When integrated into Eyeriss, the accelerator can potentially provide improvements up to 7.75x in performance. Furthermore, it can reduce the energy used for similarity detection up to 95.46%, and it can accelerate the convolutional layer up to 3.63x compared to the software-based implementation running on the CPU.

引用

页数：26

共 50 条

[41] SECDA-TFLite: A toolkit for efficient development of FPGA-based DNN accelerators for edge inference
Haris, Jude
Gibson, Perry
Cano, Jose
Agostini, Nicolas Bohm
Kaeli, David
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 173 : 140 - 151
[42] Tunable Floating-Point for Energy Efficient Accelerators
Nannarelli, Alberto
2018 IEEE 25TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2018, : 29 - 36
[43] Energy-efficient Content-aware DNN Inference for Mobile Video via Deep Reinforcement Learning
Guo, Guangfeng
Zhang, Junxing
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 763 - 768
[44] Towards CIM-friendly and Energy-Efficient DNN Accelerator via Bit-level Sparsity
Karimzadeh, Foroozan
Raychowdhury, Arijit
PROCEEDINGS OF THE 2022 IFIP/IEEE 30TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2022,
[45] Design and Test of Energy-Efficient, High-Performance, and Secure Computing Technologies via Accelerators
Henkel, Jorg
IEEE DESIGN & TEST, 2018, 35 (01) : 4 - 4
[46] Energy-Efficient Mapping for a Network of DNN Models at the Edge
Ghasemi, Mehdi
Heidari, Soroush
Kim, Young Geun
Lamb, Aaron
Wu, Carole-Jean
Vrudhula, Sarma
2021 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2021), 2021, : 25 - 30
[47] Boosting electrocatalytic nitrogen fixation via energy-efficient anodic oxidation of sodium gluconate
Zhao, Lu
Kuang, Xuan
Chen, Cheng
Sun, Xu
Wang, Zhiling
Wei, Qin
CHEMICAL COMMUNICATIONS, 2019, 55 (68) : 10170 - 10173
[48] EFFECT-DNN: Energy-efficient Edge Framework for Real-time DNN Inference
Zhang, Xiaojie
Mounesan, Motahare
Debroy, Saptarshi
2023 IEEE 24TH INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS, WOWMOM, 2023, : 10 - 20
[49] INCREASE ENERGY EFFICIENCY VIA CONDENSATE REUSE
TROOP, GL
CHEMICAL ENGINEERING PROGRESS, 1991, 87 (07) : 42 - 45
[50] Measuring and Modeling the Power Consumption of Energy-Efficient FPGA Coprocessors for GEMM and FFT
Giefers, Heiner
Polig, Raphael
Hagleitner, Christoph
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 85 (03): : 307 - 323

← 1 2 3 4 5 →