Deep neural networks compiler for a trace-based accelerator

被引:2
|
作者
Chang, Andre Xian Ming [1 ]
Zaidy, Aliasger [1 ]
Vitez, Marko [1 ]
Burzawa, Lukasz [1 ]
Culurciello, Eugenio [1 ]
机构
[1] FWDNXT Inc, 1281 Win Hentschel Blvd, W Lafayette, IN 47906 USA
关键词
DNN; Compiler; Accelerator;
D O I
10.1016/j.sysarc.2019.101659
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are the algorithm of choice for image processing applications. CNNs are a highly parallel workload that leads to the emergence of custom hardware accelerators. Deep Learning (DL) models specialized in different tasks require programmable custom hardware and a compiler/mapper to efficiently translate different CNNs into an efficient dataflow in the accelerator. The goal of this paper is to present a compiler for running CNNs on programmable custom hardware accelerators with a domain-specific ISA that targets CNNs. In this work, the compiler was evaluated and tested on a hardware accelerator that was presented in [18]. The compiler uses model definition files created from popular frameworks to generate custom instructions. The model goes through static compilation and different levels of hardware aware optimizations that improve performance and data reuse of the generated program. The software also exposes an interface to run on various FPGA platforms, providing an end-to-end solution. Various CNN models were benchmarked on different systems while scaling the number of processing units.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Trace-based performance analysis on cell BE
    Biberstein, Marina
    Shvadron, Uzi
    Turek, Javier
    Mendelson, Bilha
    Chang, Moon S.
    ISPASS 2008: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2008, : 213 - +
  • [42] Trace-Based Affine Reconstruction of Codes
    Rodriguez, Gabriel
    Andion, Jose M.
    Kandemir, Mahmut T.
    Tourino, Juan
    PROCEEDINGS OF CGO 2016: THE 14TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2016, : 139 - 149
  • [43] Deep Neural Network Accelerator based on FPGA
    Thang Viet Huynh
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 254 - 257
  • [44] Trace-based mobile network emulation
    Noble, Brian D.
    Satyanarayanan, M.
    Nguyen, Giao T.
    Katz, Randy H.
    Computer Communication Review, 1997, 27 (04): : 51 - 61
  • [45] Functionality-Based Processing-in-Memory Accelerator for Deep Convolutional Neural Networks
    Kim, Min-Jae
    Kim, Jeong-Geun
    Yoon, Su-Kyung
    Kim, Shin-Dug
    IEEE ACCESS, 2021, 9 : 145098 - 145108
  • [46] Flexible Deep-pipelined FPGA-based Accelerator for Spiking Neural Networks
    Lopez-Asuncion, Samuel
    Ituero Herrero, Pablo
    2023 38TH CONFERENCE ON DESIGN OF CIRCUITS AND INTEGRATED SYSTEMS, DCIS, 2023,
  • [47] A RULES COMPILER FOR NEURAL NETWORKS
    PAREY, C
    BONNEMAY, A
    NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 612 - 617
  • [48] Trace-based collaborative learning system
    Lafifi Y.
    Gouasmi N.
    Halimi K.
    Herkas W.
    Salhi N.
    Ghodbani A.
    Journal of Computing and Information Technology, 2010, 18 (03) : 207 - 219
  • [49] Hardware Accelerator for Adversarial Attacks on Deep Learning Neural Networks
    Guo, Haoqiang
    Peng, Lu
    Zhang, Jian
    Qi, Fang
    Duan, Lide
    2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
  • [50] RECOM: An Efficient Resistive Accelerator for Compressed Deep Neural Networks
    Ji, Houxiang
    Song, Linghao
    Jiang, Li
    Li, Ha
    Chen, Yiran
    PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2018, : 237 - 240