Towards High Performance and Accurate BNN Inference on FPGA with Structured Fine-grained Pruning

被引:1
|
作者
Fu, Keqi [1 ]
Qi, Zhi [1 ]
Cai, Jiaxuan [1 ]
Shi, Xulong [1 ]
机构
[1] Southeast Univ, Elect Engn Dept, Natl ASIC Syst Engn Res Ctr, Nanjing, Peoples R China
基金
美国国家科学基金会;
关键词
binary neural networks; structured pruning; accelerator architecture; FPGA;
D O I
10.1145/3508352.3549368
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the extreme case of quantization networks, Binary Neural Networks (BNNs) have received tremendous attention due to many hardware-friendly properties in terms of storage and computation. To reach the limit of compact models, we attempt to combine binarization with pruning techniques, further exploring the redundancy of BNNs. However, coarse-grained pruning methods may cause server accuracy drops, while traditional fine-grained ones induce irregular sparsity hard to be utilized by hardware. In this paper, we propose two advanced fine-grained BNN pruning modules, i.e., structured channel-wise kernel pruning and dynamic spatial pruning, from a joint perspective of algorithm and hardware. The pruned BNN models are trained from scratch and present not only a higher precision but also a high degree of parallelism. Then, we develop an accelerator architecture that can effectively exploit the sparsity caused by our algorithm. Finally, we implement the pruned BNN models on an embedded FPGA (Ultra96v2). The results show that our software and hardware codesign achieves 5.4x inference-speedup than the baseline BNN, with higher resource and energy efficiency compared with prior FPGA implemented BNN works.
引用
收藏
页数:9
相关论文
共 50 条
  • [11] Towards More Fine-grained and Reliable NLP Performance Prediction
    Ye, Zihuiwen
    Liu, Pengfei
    Fu, Jinlan
    Neubig, Graham
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3703 - 3714
  • [12] Fine-Grained Entity Typing with Hierarchical Inference
    Ren, Quan
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 2552 - 2558
  • [13] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344
  • [14] Towards Fine-Grained Concept Generation
    Li, Chenguang
    Liang, Jiaqing
    Xiao, Yanghua
    Jiang, Haiyun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 986 - 997
  • [15] Accurate Fine-Grained Processor Power Proxies
    Huang, Wei
    Lefurgy, Charles
    Kuk, William
    Buyuktosunoglu, Alper
    Floyd, Michael
    Rajamani, Karthick
    Allen-Ware, Malcolm
    Brock, Bishop
    2012 IEEE/ACM 45TH INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-45), 2012, : 224 - 234
  • [16] PCNN: Pattern-based Fine-Grained Regular Pruning Towards Optimizing CNN Accelerators
    Tan, Zhanhong
    Song, Jiebo
    Ma, Xiaolong
    Tan, Sia-Huat
    Chen, Hongyang
    Miao, Yuanqing
    Wu, Yifu
    Ye, Shaokai
    Wang, Yanzhi
    Li, Dehui
    Ma, Kaisheng
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
  • [17] An FPGA-Based Infrastructure for Fine-Grained DVFS Analysis in High-Performance Embedded Systems
    Mantovani, Paolo
    Cota, Emilio G.
    Tien, Kevin
    Pilato, Christian
    Di Guglielmo, Giuseppe
    Shepard, Ken
    Carloni, Luca P.
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [18] Exploiting fine-grained explicit router feedback towards high-performance transport protocols
    Nakauchi, K
    Kobayashi, K
    INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET , PROCEEDINGS, 2006, : 148 - 155
  • [19] Fine-Grained Channel Pruning for Deep Residual Neural Networks
    Chen, Siang
    Huang, Kai
    Xiong, Dongliang
    Li, Bowen
    Claesen, Luc
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 3 - 14
  • [20] High Speed Video Processing Using Fine-Grained Processing on FPGA Platform
    Ang, Zhi Ping
    Kumar, Akash
    Ha, Yajun
    2013 IEEE 21ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2013, : 85 - 88