Towards High Performance and Accurate BNN Inference on FPGA with Structured Fine-grained Pruning

被引:1
|
作者
Fu, Keqi [1 ]
Qi, Zhi [1 ]
Cai, Jiaxuan [1 ]
Shi, Xulong [1 ]
机构
[1] Southeast Univ, Elect Engn Dept, Natl ASIC Syst Engn Res Ctr, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
binary neural networks; structured pruning; accelerator architecture; FPGA;
D O I
10.1145/3508352.3549368
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the extreme case of quantization networks, Binary Neural Networks (BNNs) have received tremendous attention due to many hardware-friendly properties in terms of storage and computation. To reach the limit of compact models, we attempt to combine binarization with pruning techniques, further exploring the redundancy of BNNs. However, coarse-grained pruning methods may cause server accuracy drops, while traditional fine-grained ones induce irregular sparsity hard to be utilized by hardware. In this paper, we propose two advanced fine-grained BNN pruning modules, i.e., structured channel-wise kernel pruning and dynamic spatial pruning, from a joint perspective of algorithm and hardware. The pruned BNN models are trained from scratch and present not only a higher precision but also a high degree of parallelism. Then, we develop an accelerator architecture that can effectively exploit the sparsity caused by our algorithm. Finally, we implement the pruned BNN models on an embedded FPGA (Ultra96v2). The results show that our software and hardware codesign achieves 5.4x inference-speedup than the baseline BNN, with higher resource and energy efficiency compared with prior FPGA implemented BNN works.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Fine-Grained Defect Diagnosis for CMOL FPGA Circuits
    Kim, Jihye
    Lee, Hayoung
    Jang, Seokjun
    Kang, Sungho
    IEEE ACCESS, 2020, 8 (08): : 163140 - 163151
  • [22] Fine-Grained Urban Flow Inference With Incomplete Data
    Li, Jiyue
    Wang, Senzhang
    Zhang, Jiaqiang
    Miao, Hao
    Zhang, Junbo
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5851 - 5864
  • [23] Towards Fine-Grained, High-Coverage Internet Monitoring at Scale
    Wu, Hongyu
    Ling, Qi
    Mi, Penghui
    Ji, Chaoyang
    Hu, Yinliang
    Pi, Yibo
    PROCEEDINGS OF THE 7TH ASIA-PACIFIC WORKSHOP ON NETWORKING, APNET 2023, 2023, : 130 - 135
  • [24] Fine-grained Generalization Analysis of Structured Output Prediction
    Mustafa, Waleed
    Lei, Yunwen
    Ledent, Antoine
    Kloft, Marius
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2841 - 2847
  • [25] A theory of fine-grained lineage for functions on structured objects
    Halle, Sylvain
    Tremblay, Hugo
    THEORETICAL COMPUTER SCIENCE, 2025, 1039
  • [26] Towards Fine-Grained Localization of Privacy Behaviors
    Jain, Vijayanta
    Ghanavati, Sepideh
    Peddinti, Sai Teja
    McMillan, Collin
    2023 IEEE 8TH EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY, EUROS&P, 2023, : 258 - 277
  • [27] Towards a fine-grained access control for Cloud
    Msahli, Mounira
    Chen, Xiuzhen
    Serhrouchni, Ahmed
    2014 IEEE 11TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2014, : 286 - 291
  • [28] Attentive Fine-Grained Structured Sparsity for Image Restoration
    Oh, Junghun
    Kim, Heewon
    Nah, Seungjun
    Hong, Cheeun
    Choi, Jonghyun
    Lee, Kyoung Mu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17652 - 17661
  • [29] Towards Fine-Grained Polyp Segmentation and Classification
    Tudela, Yael
    Garcia-Rodriguez, Ana
    Fernandez-Esparrach, Gloria
    Bernal, Jorge
    CLINICAL IMAGE-BASED PROCEDURES, FAIRNESS OF AI IN MEDICAL IMAGING, AND ETHICAL AND PHILOSOPHICAL ISSUES IN MEDICAL IMAGING, CLIP 2023, FAIMI 2023, EPIMI 2023, 2023, 14242 : 32 - 42
  • [30] Towards Fine-grained Text Sentiment Transfer
    Luo, Fuli
    Li, Peng
    Yang, Pengcheng
    Zhou, Jie
    Tan, Yutong
    Chang, Baobao
    Sui, Zhifang
    Sun, Xu
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2013 - 2022