FPGA Accelerated Deep Learning for Industrial and Engineering Applications: Optimal Design Under Resource Constraints

被引:0
|
作者
Liu, Yanyi [1 ]
Du, Hang [1 ]
Wu, Yin [1 ]
Mo, Tianli [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing 210037, Peoples R China
来源
ELECTRONICS | 2025年 / 14卷 / 04期
关键词
object detection; YOLOv4-Tiny; refined resource management strategy; predefined interface latency; dynamic bit width tuning quantization;
D O I
10.3390/electronics14040703
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In response to the need for deploying the YOLOv4-Tiny model on resource-constrained Field-Programmable Gate Array (FPGA) platforms for rapid inference, this study proposes a general optimization acceleration strategy and method aimed at achieving fast inference for object detection networks. This approach centers on the synergistic effect of several key strategies: a refined resource management strategy that dynamically adjusts FPGA hardware resource allocation based on the network architecture; a dynamic dual-buffering strategy that maximizes the parallelism of data computation and transmission; an interface access latency pre-configuration strategy that effectively improves data throughput; and quantization operations for dynamic bit width tuning of model parameters and cached variables. Experimental results on the ZYNQ7020 platform demonstrate that this accelerator operates at a frequency of 200 MHz, achieving an average computing performance of 36.97 Giga Operations Per Second (GOPS) with an energy efficiency of 8.82 Giga Operations Per Second per Watt (GOPS/W). Testing with a metal surface defect dataset maintains an accuracy of approximately 90% per image, while reducing the inference delay per frame to 185 ms, representing a 52.2% improvement in inference speed. Compared to other FPGA accelerator designs, the accelerator design strategies and methods proposed in this study showcase significant enhancements in average computing performance, energy efficiency, and inference latency.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Improving deep learning model performance under parametric constraints for materials informatics applications
    Gupta, Vishu
    Peltekian, Alec
    Liao, Wei-keng
    Choudhary, Alok
    Agrawal, Ankit
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [22] Optimal reinsurance design under solvency constraints
    Avanzi, Benjamin
    Lau, Hayden
    Steffensen, Mogens
    SCANDINAVIAN ACTUARIAL JOURNAL, 2024, 2024 (04) : 383 - 416
  • [23] Improving deep learning model performance under parametric constraints for materials informatics applications
    Vishu Gupta
    Alec Peltekian
    Wei-keng Liao
    Alok Choudhary
    Ankit Agrawal
    Scientific Reports, 13
  • [24] Creativity under Constraints: The Affect of Problem Space on Design Learning among Engineering Students
    Goncher, Andrea
    C & C 09: PROCEEDINGS OF THE 2009 ACM SIGCHI CONFERENCE ON CREATIVITY AND COGNITION, 2009, : 327 - 327
  • [25] Machine learning for optimal test admission in the presence of resource constraints
    Ramy Elitzur
    Dmitry Krass
    Eyal Zimlichman
    Health Care Management Science, 2023, 26 : 279 - 300
  • [26] Machine learning for optimal test admission in the presence of resource constraints
    Elitzur, Ramy
    Krass, Dmitry
    Zimlichman, Eyal
    HEALTH CARE MANAGEMENT SCIENCE, 2023, 26 (02) : 279 - 300
  • [27] Optimal information storage in noisy synapses under resource constraints
    Varshney, Lav R.
    Sjostrom, Per Jesper
    Chklovskii, Dmitri B.
    NEURON, 2006, 52 (03) : 409 - 423
  • [28] The Pandemic Shift to Remote Learning under Resource Constraints
    Ravi P.
    Ismail A.
    Kumar N.
    Proceedings of the ACM on Human-Computer Interaction, 2021, 5 (CSCW2):
  • [29] ALGORITHM FOR OPTIMAL PROJECT SCHEDULING UNDER MULTIPLE RESOURCE CONSTRAINTS
    DAVIS, EW
    HEIDORN, GE
    MANAGEMENT SCIENCE SERIES B-APPLICATION, 1971, 17 (12): : B803 - B816
  • [30] Optimal allocation of minimal & perfect repairs under resource constraints
    Cui, LR
    Kuo, W
    Loh, HT
    Xie, A
    IEEE TRANSACTIONS ON RELIABILITY, 2004, 53 (02) : 193 - 199