FPGA-Based Training of Convolutional Neural Networks With a Reduced Precision Floating-Point Library

被引:0
|
作者
DiCecco, Roberto [1 ]
Sun, Lin [1 ]
Chow, Paul [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have been shown to have high accuracy for classification tasks in numerous applications, which has resulted in their widespread adoption. However, the high accuracy of CNNs comes at the cost of high compute and bandwidth requirements for both classification and training. In this work we discuss an FPGA-based CNN training engine: FCTE, implemented using High-Level Synthesis (HLS), targeting the Xilinx Kintex Ultrascale XCKU115 device. Furthermore, we detail custom-precision floating-point (CPFP) cores for multiplication and addition implemented using HLS, which allows for reduced area utilization. We use these cores with our engine to train networks to demonstrate that an exponent width of 6 and mantissa width of 5 achieves accuracy comparable to single-precision floating-point for the MNIST and CIFAR-10 datasets. These results are achieved using round-to-zero for the CPFP multipliers and round-to-nearest for the CPFP adders, allowing for LUT savings of 32.6% for the multipliers and 21.7% for the adders when compared to half-precision floating-point, while using the same number of DSPs.
引用
收藏
页码:239 / 242
页数:4
相关论文
共 50 条
  • [1] Static Block Floating-Point Quantization for Convolutional Neural Networks on FPGA
    Fan, Hongxiang
    Wang, Gang
    Ferianc, Martin
    Niu, Xinyu
    Luk, Wayne
    [J]. 2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 28 - 35
  • [2] An FPGA-Based Processor for Training Convolutional Neural Networks
    Liu, Zhiqiang
    Dou, Yong
    Jiang, Jingfei
    Wang, Qiang
    Chow, Paul
    [J]. 2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 207 - 210
  • [3] An FPGA-based floating-point Jacobi iterative solver
    Morris, GR
    Prasanna, VK
    [J]. 8TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND NETWORKS, PROCEEDINGS, 2005, : 420 - 427
  • [4] LDPC decoder with a limited-precision FPGA-based floating-point multiplication coprocessor
    Moberly, Raymond
    O'Sullivana, Michael
    Waheed, Khurram
    [J]. ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XVII, 2007, 6697
  • [5] A floating-point FPGA-based self-tuning regulator
    Salcic, Z
    Cao, JY
    Nguang, SK
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2006, 53 (02) : 693 - 704
  • [6] A Scalable FPGA-based Floating-Point Gaussian Filtering Architecture
    Cuong Pham-Quoc
    Binh Tran-Thanh
    Tran Ngoc Thinh
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2017, : 111 - 116
  • [7] An FPGA-based floating-point processor array supporting a high-precision dot product
    Mayer-Lindenberg, Fritz
    Beller, Valerij
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, PROCEEDINGS, 2006, : 317 - +
  • [8] Higher radix floating-point representations for FPGA-based arithmetic
    Catanzaro, B
    Nelson, B
    [J]. FCCM 2005: 13TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2005, : 161 - 170
  • [9] FPGA-based floating-point datapath design for geometry processing
    Xing, SZ
    Yu, WWH
    [J]. CONFIGURABLE COMPUTING: TECHNOLOGY AND APPLICATIONS, 1998, 3526 : 212 - 217
  • [10] SpCNA: An FPGA-based Accelerator for Point Cloud Convolutional Neural Networks
    Zhou, Gong-Lang
    Guo, Kaiyuan
    Chen, Xiang
    Leung, Kwok Wa
    [J]. 2023 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM, 2023, : 211 - 211