Low Power FPGA-SoC Design Techniques for CNN-based Object Detection Accelerator

被引:0
|
作者
Kim, Heekyung [1 ]
Choi, Ken [1 ]
机构
[1] IIT, Dept Elect & Comp Engn, Chicago, IL 60616 USA
关键词
Low Power CNN; Power Efficient; Power Efficient FIFO; Low Power Techniques; CNN-based Accelerator; FPGA design; FPGA SoC Design; Xilinx FPGA-SoC;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper shows the possibility of the existing low power register transfer level (RTL) techniques can be effective as a low power design scheme for CNN-based object recognition system acceleration in contrast to conventional techniques. Most power efficient design techniques regarding CNN acceleration are focused on the High-level Synthesis (HLS) aspect, such as memory bandwidth optimization, network architecture reconfiguration, data reuse, and batch normalization. However, these attempts have reached the limits of the effectiveness of itself. Using the postsynthesis RTL code generated by field-programmable gate arrays (FPGA) manufacturers, the proposed RTL low power design technique was applied to the original FIFO part for reducing the power consumption during data transformation. We compared the HLS optimized result with the RTL optimized result in the aspect of power consumption. We configured the testbench for the modified FIFO module and analyzed the estimated power dissipation result. These power effectiveness factors, such as a look-up table (LUT), a lookup table RAM (LUTRAM), can reduce the power dissipation by 54%, 49% respectively, even though increased block RAM (BRAM) leads to the elevated power dissipation by 154%. Thus, the total power consumption was able to be decreased by 10%. This paper discusses two factors of FPGA with system-on- chip (FPGA-SoC) design for CNN-based hardware implementation in power consumption aspect, such as RTL architecture, memory design architecture, and the model architecture-based hardware implementation methods. The virtual additional memory can support the high throughput at full speed. Our simulated low power schemes applied to Processing System (PS) and Programmable Logic (PL) architecture effectively reduced the power consumption by 25.9% in the FIFO data transformation. We established that the increased LUT blocks affect the power-efficient rate and reduce the power consumption of the PL design up to 49%.
引用
收藏
页码:1130 / 1134
页数:5
相关论文
共 50 条
  • [1] High Power-Efficient and Performance-Density FPGA Accelerator for CNN-Based Object Detection
    Zhang, Gang
    Zhang, Chaofan
    Wang, Fan
    Tang, Fulin
    Wu, Yihong
    Yang, Xuezhi
    Liu, Yong
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 117 - 128
  • [2] A Reconfigurable CNN-Based Accelerator Design for Fast and Energy-Efficient Object Detection System on Mobile FPGA
    Kim, Victoria Heekyung
    Choi, Kyuwon Ken
    [J]. IEEE ACCESS, 2023, 11 : 59438 - 59445
  • [3] The Implementation of a Power Efficient BCNN-based Object Detection Acceleration on a Xilinx FPGA-SoC
    Kim, Heekyung
    Choi, Ken
    [J]. 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 240 - 243
  • [4] A Configurable Accelerator for CNN-Based Remote Sensing Object Detection on FPGAs
    Shao, Yingzhao
    Shang, Jincheng
    Li, Yunsong
    Ding, Yueli
    Zhang, Mingming
    Ren, Ke
    Liu, Yang
    [J]. IET COMPUTERS AND DIGITAL TECHNIQUES, 2024, 2024
  • [5] Design of Embedded Vision System based on FPGA-SoC
    Alsheikhy, Ahmed
    Said, Yahia Fahem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (10) : 91 - 98
  • [6] FPGA Implementation for CNN-Based Optical Remote Sensing Object Detection
    Zhang, Ning
    Wei, Xin
    Chen, He
    Liu, Wenchao
    [J]. ELECTRONICS, 2021, 10 (03) : 1 - 24
  • [7] Optimizing CNN-Based Object Detection Algorithms on Embedded FPGA Platforms
    Zhao, Ruizhe
    Niu, Xinyu
    Wu, Yajie
    Luk, Wayne
    Liu, Qiang
    [J]. APPLIED RECONFIGURABLE COMPUTING, 2017, 10216 : 255 - 267
  • [8] FPGA-SoC implementation of YOLOv4 for flying-object detection
    Nguyen, Dai-Duong
    Nguyen, Dang-Tuan
    Le, Minh-Thuy
    Nguyen, Quoc-Cuong
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [9] Deep CNN Co-design for HEVC CU Partition Prediction on FPGA-SoC
    Bouaafia, Soulef
    Khemiri, Randa
    Messaoud, Seifeddine
    Sayadi, Fatma Ezahra
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (04) : 3283 - 3301
  • [10] Design of an Efficient CNN-Based Cough Detection System on Lightweight FPGA
    Peng, Peng
    Jiang, Kai
    You, Mingyu
    Xie, Jialin
    Zhou, Hongjun
    Xu, Weisheng
    Lu, Jicheng
    Li, Xiayu
    Xu, Yun
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2023, 17 (01) : 116 - 128