Deep neural networks compiler for a trace-based accelerator

被引:2
|
作者
Chang, Andre Xian Ming [1 ]
Zaidy, Aliasger [1 ]
Vitez, Marko [1 ]
Burzawa, Lukasz [1 ]
Culurciello, Eugenio [1 ]
机构
[1] FWDNXT Inc, 1281 Win Hentschel Blvd, W Lafayette, IN 47906 USA
关键词
DNN; Compiler; Accelerator;
D O I
10.1016/j.sysarc.2019.101659
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are the algorithm of choice for image processing applications. CNNs are a highly parallel workload that leads to the emergence of custom hardware accelerators. Deep Learning (DL) models specialized in different tasks require programmable custom hardware and a compiler/mapper to efficiently translate different CNNs into an efficient dataflow in the accelerator. The goal of this paper is to present a compiler for running CNNs on programmable custom hardware accelerators with a domain-specific ISA that targets CNNs. In this work, the compiler was evaluated and tested on a hardware accelerator that was presented in [18]. The compiler uses model definition files created from popular frameworks to generate custom instructions. The model goes through static compilation and different levels of hardware aware optimizations that improve performance and data reuse of the generated program. The software also exposes an interface to run on various FPGA platforms, providing an end-to-end solution. Various CNN models were benchmarked on different systems while scaling the number of processing units.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Deep Neural Networks Compiler for a Trace-Based Accelerator (Short WIP Paper)
    Chang, Andre Xian Ming
    Zaidy, Aliasger
    Burzawa, Lukasz
    Culurciello, Eugenio
    ACM SIGPLAN NOTICES, 2018, 53 (06) : 89 - 93
  • [2] SPUR: A Trace-Based JIT Compiler for CIL
    Bebenita, Michael
    Brandner, Florian
    Fahndrich, Manuel
    Logozzo, Francesco
    Schulte, Wolfram
    Tillmann, Nikolai
    Venter, Herman
    ACM SIGPLAN NOTICES, 2010, 45 (10) : 708 - 725
  • [3] A trace-based Java JIT compiler retrofitted from a method-based compiler
    IBM Research, Tokyo, Japan
    不详
    Proc. - Int. Symp. Code Gener. Optim., CGO, (246-256):
  • [4] Trace transitioning and exception handling in a Trace-based JIT compiler for Java
    Häubl, Christian
    Wimmer, Christian
    Mössenböck, Hanspeter
    Transactions on Architecture and Code Optimization, 2014, 11 (01):
  • [5] Trace Transitioning and Exception Handling in a Trace-Based JIT Compiler for Java']Java
    Haeubl, Christian
    Wimmer, Christian
    Moessenboeck, Hanspeter
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2014, 11 (01)
  • [6] A Trace-based Java']Java JIT Compiler Retrofitted from a Method-based Compiler
    Inoue, Hiroshi
    Hayashizaki, Hiroshige
    Wu, Peng
    Nakatani, Toshio
    2011 9TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2011, : 246 - 256
  • [7] Training Neural Machines with Trace-Based Supervision
    Mirman, Matthew B.
    Dimitrov, Dimitar
    Djordjevie, Pavle
    Gehr, Timon
    Vechev, Martin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [8] A Hardware Accelerator Based on Quantized Weights for Deep Neural Networks
    Sreehari, R.
    Deepu, Vijayasenan
    Arulalan, M. R.
    EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 1079 - 1091
  • [9] Implementation of FPGA-based Accelerator for Deep Neural Networks
    Tsai, Tsung-Han
    Ho, Yuan-Chen
    Sheu, Ming-Hwa
    2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
  • [10] Adaptive Multi-Level Compilation in a Trace-based Java']Java JIT Compiler
    Inoue, Hiroshi
    Hayashizaki, Hiroshige
    Wu, Peng
    Nakatani, Toshio
    ACM SIGPLAN NOTICES, 2012, 47 (10) : 179 - 193