Optimizing Deep Learning Acceleration on FPGA for Real-Time and Resource-Efficient Image Classification

被引:2
|
作者
Khaki, Ahmad Mouri Zadeh [1 ]
Choi, Ahyoung [1 ]
机构
[1] Gachon Univ, Dept AI & Software, Seongnam Si 13120, South Korea
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 01期
关键词
AI hardware acceleration; convolutional neural network (CNN); deep learning; field-programmable gate array (FPGA); transfer learning; TO-DIGITAL CONVERTER; DESIGN; IMPLEMENTATION; EYE; CNN;
D O I
10.3390/app15010422
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Deep learning (DL) has revolutionized image classification, yet deploying convolutional neural networks (CNNs) on edge devices for real-time applications remains a significant challenge due to constraints in computation, memory, and power efficiency. This work presents an optimized implementation of VGG16 and VGG19, two widely used CNN architectures, for classifying the CIFAR-10 dataset using transfer learning on field-programmable gate arrays (FPGAs). Utilizing the Xilinx Vitis-AI and TensorFlow2 frameworks, we adapt VGG16 and VGG19 for FPGA deployment through quantization, compression, and hardware-specific optimizations. Our implementation achieves high classification accuracy, with Top-1 accuracy of 89.54% and 87.47% for VGG16 and VGG19, respectively, while delivering significant reductions in inference latency (7.29x and 6.6x compared to CPU-based alternatives). These results highlight the suitability of our approach for resource-efficient, real-time edge applications. Key contributions include a detailed methodology for combining transfer learning with FPGA acceleration, an analysis of hardware resource utilization, and performance benchmarks. This work underscores the potential of FPGA-based solutions to enable scalable, low-latency DL deployments in domains such as autonomous systems, IoT, and mobile devices.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] ResCoNN: Resource-Efficient FPGA-Accelerated CNN for Traffic Sign Classification
    Lechner, Martin
    Jantsch, Axel
    Dinakarrao, Sai Manoj Pudukotai
    2019 TENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2019,
  • [32] Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification
    Faizan, Muhammad
    Intzes, Ioannis
    Cretu, Ioana
    Meng, Hongying
    TECHNOLOGIES, 2023, 11 (04)
  • [33] Resource-Efficient Deep Learning: Fast Hand Gestures on Microcontrollers
    Mach, Tuan Kiet Tran
    Van, Khai Nguyen
    Le, Minhhuy
    EAI Endorsed Transactions on Industrial Networks and Intelligent Systems, 2024, 11 (03) : 1 - 11
  • [34] A new real-time resource-efficient algorithm for ECG denoising, feature extraction and classification-based wearable sensor network
    Marhoon, Ali Fadel
    Hamad, Ali Hussein
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2015, 18 (02) : 103 - 114
  • [35] Research of a resource-efficient, real-time and fault-tolerant wireless sensor network system
    Liu, Xing
    Zhou, Haiying
    Xiong, Shengwu
    Hou, Kun Mean
    De Vaulx, Christophe
    Shi, Hongling
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2016, 31 : 3 - 13
  • [36] HybriDC: A Resource-Efficient CPU-FPGA Heterogeneous Acceleration System for Lossless Data Compression
    Liu, Puguang
    Wei, Ziling
    Yu, Chuan
    Chen, Shuhui
    MICROMACHINES, 2022, 13 (11)
  • [37] FPGA-Based Parallel Hardware Architecture for Real-Time Image Classification
    Qasaimeh, Murad
    Sagahyroon, Assim
    Shanableh, Tamer
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2015, 1 (01) : 56 - 70
  • [38] Efficient FPGA Implementation of Multilayer Perceptron for Real-Time Human Activity Classification
    Gaikwad, Nikhil B.
    Tiwari, Varun
    Keskar, Avinash
    Shivaprakash, N. C.
    IEEE ACCESS, 2019, 7 : 26696 - 26706
  • [39] Semantic-Based Optimization of Deep Learning for Efficient Real-Time Medical Image Segmentation
    Wei, Zhenkun
    Liu, Jia
    Yao, Yu
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2024, 20 (01)
  • [40] FPGA Acceleration of a Supervised Learning Method for Hyperspectral Image Classification
    Tajiri, Kento
    Maruyama, Tsutomu
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 273 - 276