Extensible Embedded Processor for Convolutional Neural Networks

被引:1
|
作者
Misko, Joshua [1 ]
Jadhav, Shrikant S. [2 ]
Kim, Youngsoo [3 ]
机构
[1] San Jose State Univ, San Jose, CA 95192 USA
[2] Ft Lewis Coll, Durango, CO 81301 USA
[3] Bradley Univ, Peoria, IL 61625 USA
关键词
D O I
10.1155/2021/6630552
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Convolutional neural networks (CNNs) require significant computing power during inference. Smart phones, for example, may not run a facial recognition system or search algorithm smoothly due to the lack of resources and supporting hardware. Methods for reducing memory size and increasing execution speed have been explored, but choosing effective techniques for an application requires extensive knowledge of the network architecture. This paper proposes a general approach to preparing a compressed deep neural network processor for inference with minimal additions to existing microprocessor hardware. To show the benefits to the proposed approach, an example CNN for synthetic aperture radar target classification is modified and complimentary custom processor instructions are designed. The modified CNN is examined to show the effects of the modifications and the custom processor instructions are profiled to illustrate the potential performance increase from the new extended instructions.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Designing A Compact Convolutional Neural Network Processor on Embedded FPGAs
    Ling, Yin-Chun
    Chin, Hsu-Hsun
    Wu, Hsin-, I
    Tsay, Ren-Song
    [J]. 2020 IEEE GLOBAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (GCAIOT), 2020, : 71 - 77
  • [2] A DSP engine for an extensible media embedded processor
    Furusawa, T
    Katayama, F
    Arai, Y
    Inoue, S
    Matsui, M
    Nishikawa, M
    Yoshimoto, T
    [J]. PROCEEDINGS OF 2004 IEEE ASIA-PACIFIC CONFERENCE ON ADVANCED SYSTEM INTEGRATED CIRCUITS, 2004, : 160 - 163
  • [3] A self-adaptive extensible embedded processor
    Bauer, Lars
    Shafique, Muhammad
    Teufel, Dirk
    Henkel, Joerg
    [J]. FIRST IEEE INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS, 2007, : 344 - +
  • [4] LAPPED CONVOLUTIONAL NEURAL NETWORKS FOR EMBEDDED SYSTEMS
    Wang, Xing
    Ng, Him Wai
    Liang, Jie
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1135 - 1139
  • [5] On Extensible Networks for Embedded Systems
    Poelzlbauer, Florian
    Bate, Iain
    Brenner, Eugen
    [J]. 2013 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON THE ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS 2013), 2013, : 69 - 77
  • [6] An FPGA-Based Processor for Training Convolutional Neural Networks
    Liu, Zhiqiang
    Dou, Yong
    Jiang, Jingfei
    Wang, Qiang
    Chow, Paul
    [J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 207 - 210
  • [7] Performance Analysis of Convolutional Neural Networks on Embedded Systems
    Grzymkowski, Lukasz
    Stefanski, Tomasz P.
    [J]. PROCEEDINGS OF 2020 27TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES), 2020, : 266 - 271
  • [8] Embedded facial image processing with Convolutional Neural Networks
    Mamalet, Franck
    Roux, Sebastien
    Garcia, Christophe
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 261 - 264
  • [9] A Ternary Weight Binary Input Convolutional Neural Network: Realization on the Embedded Processor
    Yonekawa, Haruyoshi
    Sato, Shimpei
    Nakahara, Hiroki
    [J]. 2018 IEEE 48TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL 2018), 2018, : 174 - 179
  • [10] OPU: An FPGA-Based Overlay Processor for Convolutional Neural Networks
    Yu, Yunxuan
    Wu, Chen
    Zhao, Tiandong
    Wang, Kun
    He, Lei
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 35 - 47