FPGA-Based Depth Separable Convolution Neural Network

被引:1
|
作者
Lai, Yeong-Kang [1 ]
Hwang, Yu-Hao [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung, Taiwan
关键词
D O I
10.1109/icce46568.2020.9043044
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In order to enable convolution neural network (CNN) to be deployed on a Field Programmable Gate Array (FPGA), this study builds a lightweight convolutional neural network that can be separated by a depth to reduce the amount of parameters and computations stored. We replaced the standard convolution operation with a separate convolution operation, and proposed a hardware accelerator architecture that can handle differently sized depth-separable convolution operations, using parallelization to efficiently utilize hardware resources for depth separable convolution. Therefore, data can be reused to reduce number of memory accesses. This hardware accelerator can achieve 588 frames per second and 37.88M ops/sec throughput at 100MHz clock.
引用
收藏
页码:741 / 742
页数:2
相关论文
共 50 条
  • [21] An Efficient FPGA-Based Accelerator Design for Convolution
    Song, Peng-Fei
    Pan, Jeng-Shyang
    Yang, Chun-Sheng
    Lee, Chiou-Yng
    [J]. 2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 494 - 500
  • [22] An FPGA-based Accelerator Platform Implements for Convolutional Neural Network
    Meng, Xiao
    Yu, Lixin
    Qin, Zhiyong
    [J]. 2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 25 - 28
  • [23] [DL] A Survey of FPGA-based Neural Network Inference Accelerators
    Guo, Kaiyuan
    Zeng, Shulin
    Yu, Jincheng
    Wang, Yu
    Yang, Huazhong
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (01)
  • [24] Implementing Binarized Neural Network Processor on FPGA-Based Platform
    Lee, Jeahack
    Kim, Hyeonseong
    Kim, Byung-Soo
    Jeon, Seokhun
    Lee, Jung Chul
    Kim, Dong Sun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 469 - 471
  • [25] FPGA-based system for artificial neural network arrhythmia classification
    Zairi, Hadjer
    Talha, Malika Kedir
    Meddah, Karim
    Slimane, Saliha Ould
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4105 - 4120
  • [26] A networked FPGA-based hardware implementation of a neural network application
    Restrepo, HF
    Hoffmann, R
    Perez-Uribe, A
    Teuscher, C
    Sanchez, E
    [J]. 2000 IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2000, : 337 - 338
  • [27] FPGA based convolution and memory architecture for Convolutional Neural Network
    Shahan, K. A.
    Rani, Sheeba J.
    [J]. 2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 183 - 188
  • [28] Neural network processor for a FPGA-based multiband fluorometer device
    Tabari, Karima
    Boukadoum, Mounir
    Bensaoula, Abdelhak
    Starikov, David
    [J]. 2006 INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION AND SENSING, 2006, : 198 - +
  • [29] FPGA-based system for artificial neural network arrhythmia classification
    Hadjer Zairi
    Malika Kedir Talha
    Karim Meddah
    Saliha Ould Slimane
    [J]. Neural Computing and Applications, 2020, 32 : 4105 - 4120
  • [30] Throughput optimizations for FPGA-based deep neural network inference
    Posewsky, Thorbjoern
    Ziener, Daniel
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2018, 60 : 151 - 161