A Reconfigurable DNN Training Accelerator on FPGA

被引:3
|
作者
Lu, Jinming [1 ]
Lin, Jun [1 ]
Wang, Zhongfeng [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural networks; hardware accelerator; training; batch normalization; FPGA;
D O I
10.1109/sips50750.2020.9195234
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, deep neural networks (DNNs) have been widely applied in various tasks, demonstrating outstanding performance. To further outspread in practical applications, the efficient hardware implementation of DNNs is becoming a critical issue. With the rise of online learning, training DNNs on resource-constrained platforms has attracted more attention most recently. In this paper, we propose an FPGA-based accelerator for efficient DNN training. First, a reconfigurable processing element is designed, which is flexible to support various computation patterns during training in a unified architecture. Second, a well optimized architecture is presented to perform the computation of batch normalization layers in different stages. Finally, a prevailing model (ResNet-20) for CIFAR-10 dataset is implemented on Xilinx VC706 platform with our framework. Experimental results show that our design achieves 421 GOPS and 43.18 GOPS/W in terms of throughput and energy efficiency, respectively. The comparison results illustrate that our accelerator significantly outperforms prior works.
引用
收藏
页码:94 / 99
页数:6
相关论文
共 50 条
  • [1] An FPGA-Based Reconfigurable Accelerator for Low-Bit DNN Training
    Shao, Haikuo
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    [J]. 2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 254 - 259
  • [2] A Reconfigurable Accelerator for Generative Adversarial Network Training Based on FPGA
    Yin, Tongtong
    Mao, Wendong
    Lu, Jinming
    Wang, Zhongfeng
    [J]. 2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 144 - 149
  • [3] An FPGA-Based Reconfigurable CNN Training Accelerator Using Decomposable Winograd
    Wang, Hui
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    [J]. 2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 175 - 180
  • [4] Acceleration of DNN Training Regularization: Dropout Accelerator
    Lee, Gunhee
    Park, Hanmin
    Ryu, Soojung
    Lee, Hyuk-Jae
    [J]. 2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [5] Towards Reconfigurable CNN Accelerator for FPGA Implementation
    Syed, Rizwan Tariq
    Andjelkovic, Marko
    Ulbricht, Markus
    Krstic, Milos
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (03) : 1249 - 1253
  • [6] Voltage Scaled Low Power DNN Accelerator Design on Reconfigurable Platform
    Paul, Rourab
    Sarkar, Sreetama
    Sau, Suman
    Roy, Sanghamitra
    Chakraborty, Koushik
    Chakrabarti, Amlan
    [J]. ELECTRONICS, 2024, 13 (08)
  • [7] Reconfigurable android with an FPGA accelerator for the future embedded devices
    Nakajo, Hironori
    Koike, Keisuke
    Ohta, Atsushi
    Ohshima, Kohta
    Fujinami, Kaori
    [J]. Proceedings - 2011 2nd International Conference on Networking and Computing, ICNC 2011, 2011, : 173 - 178
  • [8] An Approach Towards Distributed DNN Training on FPGA Clusters
    Kreowsky, Philipp
    Knapheide, Justin
    Stabernack, Benno
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS, ARCS 2024, 2024, 14842 : 18 - 32
  • [9] Hardware Accelerator Design for Sparse DNN Inference and Training: A Tutorial
    Mao, Wendong
    Wang, Meiqi
    Xie, Xiaoru
    Wu, Xiao
    Wang, Zhongfeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1708 - 1714
  • [10] Securing the Cloud with Reconfigurable Computing: An FPGA Accelerator for Homomorphic Encryption
    Cilardo, Alessandro
    Argenziano, Domenico
    [J]. PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1622 - 1627