Towards High Performance and Accurate BNN Inference on FPGA with Structured Fine-grained Pruning

被引:1
|
作者
Fu, Keqi [1 ]
Qi, Zhi [1 ]
Cai, Jiaxuan [1 ]
Shi, Xulong [1 ]
机构
[1] Southeast Univ, Elect Engn Dept, Natl ASIC Syst Engn Res Ctr, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
binary neural networks; structured pruning; accelerator architecture; FPGA;
D O I
10.1145/3508352.3549368
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the extreme case of quantization networks, Binary Neural Networks (BNNs) have received tremendous attention due to many hardware-friendly properties in terms of storage and computation. To reach the limit of compact models, we attempt to combine binarization with pruning techniques, further exploring the redundancy of BNNs. However, coarse-grained pruning methods may cause server accuracy drops, while traditional fine-grained ones induce irregular sparsity hard to be utilized by hardware. In this paper, we propose two advanced fine-grained BNN pruning modules, i.e., structured channel-wise kernel pruning and dynamic spatial pruning, from a joint perspective of algorithm and hardware. The pruned BNN models are trained from scratch and present not only a higher precision but also a high degree of parallelism. Then, we develop an accelerator architecture that can effectively exploit the sparsity caused by our algorithm. Finally, we implement the pruned BNN models on an embedded FPGA (Ultra96v2). The results show that our software and hardware codesign achieves 5.4x inference-speedup than the baseline BNN, with higher resource and energy efficiency compared with prior FPGA implemented BNN works.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Towards Fine-Grained Verification of Application Mobility
    Zhou, Yu
    Huang, Yankai
    Ge, Jidong
    Hu, Jun
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013 WORKSHOPS, 2014, 8182 : 75 - 83
  • [32] THERMAL PERFORMANCE OF FINE-GRAINED SOILS
    SALOMONE, LA
    KOVACS, WD
    KUSUDA, T
    JOURNAL OF GEOTECHNICAL ENGINEERING-ASCE, 1984, 110 (03): : 359 - 374
  • [33] A fine-grained Ethernet performance model
    Schneidewind, NF
    TELECOMMUNICATION SYSTEMS, 1996, 6 (01) : 77 - 90
  • [34] Towards Fine-Grained Webpage Fingerprinting at Scale
    Zhao, Xiyuan
    Deng, Xinhao
    Li, Qi
    Liu, Yunpeng
    Liu, Zhuotao
    Sun, Kun
    Xu, Ke
    PROCEEDINGS OF THE 2024 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2024, 2024, : 423 - 436
  • [35] Spreading Fine-Grained Prior Knowledge for Accurate Tracking
    Nie, Jiahao
    Wu, Han
    He, Zhiwei
    Gao, Mingyu
    Dong, Zhekang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6186 - 6199
  • [36] A Method for Accurate High-Level Performance Evaluation of MPSoC Architectures Using Fine-Grained Generated Traces
    Plyaskin, Roman
    Herkersdorf, Andreas
    ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2010, PROCEEDINGS, 2010, 5974 : 199 - 210
  • [37] Learning fine-grained search space pruning and heuristics for combinatorial optimization
    Juho Lauri
    Sourav Dutta
    Marco Grassia
    Deepak Ajwani
    Journal of Heuristics, 2023, 29 : 313 - 347
  • [38] Learning fine-grained search space pruning and heuristics for combinatorial optimization
    Lauri, Juho
    Dutta, Sourav
    Grassia, Marco
    Ajwani, Deepak
    JOURNAL OF HEURISTICS, 2023, 29 (2-3) : 313 - 347
  • [39] Nimblock: Scheduling for Fine-grained FPGA Sharing through Virtualization
    Mandava, Meghna
    Reckamp, Paul
    Chen, Deming
    PROCEEDINGS OF THE 2023 THE 50TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2023, 2023, : 843 - 855
  • [40] A Heterogeneous Framework to Accelerate CNNs with Fine-Grained FPGA Management
    Guo K.-C.
    Wu C.-G.
    Zhang W.-F.
    Qi Z.-W.
    Guan H.-B.
    1600, Science Press (44): : 2529 - 2541