Deep neural networks compiler for a trace-based accelerator

被引：2

作者：

Chang, Andre Xian Ming ^{[1
]}

Zaidy, Aliasger ^{[1
]}

Vitez, Marko ^{[1
]}

Burzawa, Lukasz ^{[1
]}

Culurciello, Eugenio ^{[1
]}

机构：

[1] FWDNXT Inc, 1281 Win Hentschel Blvd, W Lafayette, IN 47906 USA

来源：

JOURNAL OF SYSTEMS ARCHITECTURE | 2020年 / 102卷

关键词：

DNN; Compiler; Accelerator;

D O I：

10.1016/j.sysarc.2019.101659

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks (CNNs) are the algorithm of choice for image processing applications. CNNs are a highly parallel workload that leads to the emergence of custom hardware accelerators. Deep Learning (DL) models specialized in different tasks require programmable custom hardware and a compiler/mapper to efficiently translate different CNNs into an efficient dataflow in the accelerator. The goal of this paper is to present a compiler for running CNNs on programmable custom hardware accelerators with a domain-specific ISA that targets CNNs. In this work, the compiler was evaluated and tested on a hardware accelerator that was presented in [18]. The compiler uses model definition files created from popular frameworks to generate custom instructions. The model goes through static compilation and different levels of hardware aware optimizations that improve performance and data reuse of the generated program. The software also exposes an interface to run on various FPGA platforms, providing an end-to-end solution. Various CNN models were benchmarked on different systems while scaling the number of processing units.

引用

页数：9

共 50 条

[41] Trace-based performance analysis on cell BE
Biberstein, Marina
Shvadron, Uzi
Turek, Javier
Mendelson, Bilha
Chang, Moon S.
ISPASS 2008: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2008, : 213 - +
[42] Trace-Based Affine Reconstruction of Codes
Rodriguez, Gabriel
Andion, Jose M.
Kandemir, Mahmut T.
Tourino, Juan
PROCEEDINGS OF CGO 2016: THE 14TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2016, : 139 - 149
[43] Deep Neural Network Accelerator based on FPGA
Thang Viet Huynh
2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 254 - 257
[44] Trace-based mobile network emulation
Noble, Brian D.
Satyanarayanan, M.
Nguyen, Giao T.
Katz, Randy H.
Computer Communication Review, 1997, 27 (04): : 51 - 61
[45] Functionality-Based Processing-in-Memory Accelerator for Deep Convolutional Neural Networks
Kim, Min-Jae
Kim, Jeong-Geun
Yoon, Su-Kyung
Kim, Shin-Dug
IEEE ACCESS, 2021, 9 : 145098 - 145108
[46] Flexible Deep-pipelined FPGA-based Accelerator for Spiking Neural Networks
Lopez-Asuncion, Samuel
Ituero Herrero, Pablo
2023 38TH CONFERENCE ON DESIGN OF CIRCUITS AND INTEGRATED SYSTEMS, DCIS, 2023,
[47] A RULES COMPILER FOR NEURAL NETWORKS
PAREY, C
BONNEMAY, A
NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 612 - 617
[48] Trace-based collaborative learning system
Lafifi Y.
Gouasmi N.
Halimi K.
Herkas W.
Salhi N.
Ghodbani A.
Journal of Computing and Information Technology, 2010, 18 (03) : 207 - 219
[49] Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks
Guo, Haoqiang
Peng, Lu
Zhang, Jian
Qi, Fang
Duan, Lide
2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
[50] RECOM: An Efficient Resistive Accelerator for Compressed Deep Neural Networks
Ji, Houxiang
Song, Linghao
Jiang, Li
Li, Ha
Chen, Yiran
PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 237 - 240

← 1 2 3 4 5 →