Energy-Efficient and High-Throughput FPGA-based Accelerator for Convolutional Neural Networks

被引:0
|
作者
Feng, Gan [1 ]
Hu, Zuyi [1 ]
Chen, Song [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230046, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
FPGA; CNN; accelerator; energy-efficient; LeNet-5;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNN) is widely applied in modern machine learning and pattern recognition area. Not only performance, more and more attention is paid on energy efficienct and scalable devices like FPGA as a better solution than CPU and GPU. In this paper, we propose methods to optimize CNN by fixed-point quantization, activation function approximation, loops and tasks pipelining and parallelization, memory reorganization, and implement an energy-efficient and high-throughput FPGA-based CNN accelerator for LeNet-5 based on Zynq-7000 platform. The accelerator can run at 166MHz and achieve a low error rate of 0.99%, the same as software implementations, and has 37% higher throughput and 93.7% less energy dissipation than a GPU implementation.
引用
收藏
页码:624 / 626
页数:3
相关论文
共 50 条
  • [1] An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
    Islam, Md Najrul
    Shrestha, Rahul
    Chowdhury, Shubhajit Roy
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (12) : 1891 - 1901
  • [2] EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks
    Liang, Shengwen
    Wang, Ying
    Liu, Cheng
    He, Lei
    Li, Huawei
    Xu, Dawen
    Li, Xiaowei
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (09) : 1511 - 1525
  • [3] An FPGA-Based Energy-Efficient Reconfigurable Convolutional Neural Network Accelerator for Object Recognition Applications
    Li, Jixuan
    Un, Ka-Fai
    Yu, Wei-Han
    Mak, Pui-In
    Martins, Rui P.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (09) : 3143 - 3147
  • [4] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
    Irmak, Hasan
    Alachiotis, Nikolaos
    Ziener, Daniel
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [5] Optimization of Energy Efficiency for FPGA-Based Convolutional Neural Networks Accelerator
    Tang, Yongming
    Dai, Rongshi
    Xie, Yi
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2020), 2020, 1487
  • [6] An FPGA-Based High-Throughput Dataflow Accelerator for Lightweight Neural Network
    Zhao, Zhiyuan
    Li, Jixing
    Chen, Gang
    Jiang, Zhelong
    Qiao, Ruixiu
    Xu, Peng
    Chen, Yihao
    Lu, Huaxiang
    [J]. 2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [7] Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights
    Duan, Yunzhi
    Li, Shuai
    Zhang, Ruipeng
    Wang, Qi
    Chen, Jienan
    Sobelman, Gerald E.
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [8] A 0.59μJ/pixel High-throughput Energy-efficient Neural Volume Rendering Accelerator on FPGA
    Yuan, Zhechen
    Yuan, Binzhe
    Gu, Yuhan
    Zheng, Yueyang
    He, Yunxiang
    Wang, Xuexin
    Rao, Chaolin
    Zhou, Pingqiang
    Yu, Jingyi
    Lou, Xin
    [J]. 2024 IEEE CUSTOM INTEGRATED CIRCUITS CONFERENCE, CICC, 2024,
  • [9] High Energy Efficiency FPGA-based Accelerator for Convolutional Neural Networks Using Weight Combination
    Shu, Chenghao
    Pang, Wei
    Liu, Hao
    Lu, Shengli
    [J]. 2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 578 - 582
  • [10] An FPGA-Based High-Throughput Keypoint Detection Accelerator Using Convolutional Neural Network for Mobile Robot Applications
    Li, Jingyuan
    Liu, Ye
    Huang, Kun
    Zhou, Liang
    Chang, Liang
    Zhou, Jun
    [J]. 2022 IEEE ASIA PACIFIC CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS, PRIMEASIA, 2022, : 81 - 84