A streaming accelerator of Convolutional Neural Networks for resource-limited applications

被引:6
|
作者
Arredondo-Velazquez, Moises [1 ]
Diaz-Carmona, Javier [1 ]
Torres-Huitzil, Cesar [2 ]
Barranco-Gutierrez, Alejandro-Israel [1 ]
Padilla-Medina, Alfredo [1 ]
Prado-Olivarez, Juan [1 ]
机构
[1] Technol Inst Celaya, Elect Engn Dept, Av Tecnol & G Cubas S-N, Celaya 38010, Gto, Mexico
[2] Tecnol Monterrey, Sch Engn & Sci, Campus Puebla,Av Atlixcayotl 5718, Puebla 72453, Mexico
来源
IEICE ELECTRONICS EXPRESS | 2019年 / 16卷 / 23期
关键词
Convolutional Neural Networks; streaming architecture; Layer Operation Chaining;
D O I
10.1587/elex.16.20190633
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neuronal Networks (CNN) implementation on embedded devices is restricted due to the number of layers of some CNN models. In this context, this paper describes a novel architecture based on Layer Operation Chaining (LOC) which uses fewer convolvers than convolution layers. A reutilization of hardware convolvers is promoted through kernel decomposition. Thus, an architectural design with reduced resources utilization is achieved, suitable to be implemented on low-end devices as a solution for portable classification applications. Experimental results show that the proposed design has a competitive processing time and overcomes resource utilization when compared with state-of-the-art related works.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Resource-Limited Hardware Accelerator for Convolutional Neural Networks in Embedded Vision Applications
    Moini, Shayan
    Alizadeh, Bijan
    Emad, Mohammad
    Ebrahimpour, Reza
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2017, 64 (10) : 1217 - 1221
  • [2] Embedded Streaming Deep Neural Networks Accelerator With Applications
    Dundar, Aysegul
    Jin, Jonghoon
    Martini, Berin
    Culurciello, Eugenio
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) : 1572 - 1583
  • [3] Scalable FPGA Accelerator for Deep Convolutional Neural Networks with Stochastic Streaming
    Alawad, Mohammed
    Lin, Mingjie
    [J]. IEEE TRANSACTIONS ON MULTI-SCALE COMPUTING SYSTEMS, 2018, 4 (04): : 888 - 899
  • [4] An Efficient Streaming Accelerator for Low Bit-Width Convolutional Neural Networks
    Chen, Qinyu
    Fu, Yuxiang
    Song, Wenqing
    Cheng, Kaifeng
    Lu, Zhonghai
    Zhang, Chuan
    Li, Li
    [J]. ELECTRONICS, 2019, 8 (04)
  • [5] A Resource-Efficient Inference Accelerator for Binary Convolutional Neural Networks
    Kim, Tae-Hwan
    Shin, Jihoon
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (01) : 451 - 455
  • [6] Efficient Binarized Convolutional Layers for Visual Inspection Applications on Resource-Limited FPGAs and ASICs
    Simons, Taylor
    Lee, Dah-Jye
    [J]. ELECTRONICS, 2021, 10 (13)
  • [7] Accelerator Design with Effective Resource Utilization for Binary Convolutional Neural Networks on an FPGA
    Kim, Sunwoong
    Rutenbar, Rob A.
    [J]. PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 218 - 218
  • [8] Accelerator Design for Convolutional Neural Network with Vertical Data Streaming
    Li, Shanliao
    Ning, Ouyang
    Wang, Zheng
    [J]. 2018 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2018), 2018, : 544 - 547
  • [9] A Sequential Approach to Detect Drifts and Retrain Neural Networks on Resource-Limited Edge Devices
    Sunaga, Kazuki
    Yamada, Takeya
    Matsutani, Hiroki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107 (06) : 741 - 750
  • [10] Evolutionary Structure Optimization of Convolutional Neural Networks for Deployment on Resource Limited Systems
    Zhang, Qianyu
    Li, Bin
    Wu, Yi
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 742 - 753