Structured Binary Neural Networks for Image Recognition

被引:0
|
作者
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
机构
[1] Monash University,Faculty of Information Technology
[2] Zhejiang University,School of Software Engineering
[3] South China University of Technology,Key Laboratory of Big Data and Intelligent Robot
[4] Ministry of Education,School of Computer Science
[5] The University of Adelaide,undefined
来源
关键词
Binary neural networks; Quantization; Image classification; Semantic segmentation; Object detection;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose to train binarized convolutional neural networks (CNNs) that are of significant importance for deploying deep learning to mobile devices with limited power capacity and computing resources. Previous works on quantizing CNNs often seek to approximate the floating-point information of weights and/or activations using a set of discrete values. Such methods, termed value approximation here, typically are built on the same network architecture of the full-precision counterpart. Instead, we take a new “structured approximation” view for network quantization — it is possible and valuable to exploit flexible architecture transformation when learning low-bit networks, which can achieve even better performance than the original networks in some cases. In particular, we propose a “group decomposition” strategy, termed GroupNet, which divides a network into desired groups. Interestingly, with our GroupNet strategy, each full-precision group can be effectively reconstructed by aggregating a set of homogeneous binary branches. We also propose to learn effective connections among groups to improve the representation capability. To improve the model capacity, we propose to dynamically execute sparse binary branches conditioned on input features while preserving the computational cost. More importantly, the proposed GroupNet shows strong flexibility for a few vision tasks. For instance, we extend the GroupNet for accurate semantic segmentation by embedding the rich context into the binary structure. The proposed GroupNet also shows strong performance on object detection. Experiments on image classification, semantic segmentation, and object detection tasks demonstrate the superior performance of the proposed methods over various quantized networks in the literature. Moreover, the speedup and runtime memory cost evaluation comparing with related quantization strategies is analyzed on GPU platforms, which serves as a strong benchmark for further research.
引用
收藏
页码:2081 / 2102
页数:21
相关论文
共 50 条
  • [1] Structured Binary Neural Networks for Image Recognition
    Zhuang, Bohan
    Shen, Chunhua
    Tan, Mingkui
    Chen, Peng
    Liu, Lingqiao
    Reid, Ian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (09) : 2081 - 2102
  • [2] Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
    Zhuang, Bohan
    Shen, Chunhua
    Tan, Mingkui
    Liu, Lingqiao
    Reid, Ian
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 413 - 422
  • [3] Structured neural networks for pattern recognition
    Murino, V
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (04): : 553 - 561
  • [4] Tree-Structured Binary Neural Networks
    Serbetci, Ayse
    Akgul, Yusuf Sinan
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [5] Binary neural networks for speech recognition
    Yan-min Qian
    Xu Xiang
    [J]. Frontiers of Information Technology & Electronic Engineering, 2019, 20 : 701 - 715
  • [6] Binary neural networks for speech recognition
    Qian, Yan-min
    Xiang, Xu
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (05) : 701 - 715
  • [7] IMAGE RECOGNITION WITH HARDWARE NEURAL NETWORKS
    Berkolds, Karlis
    [J]. 15TH INTERNATIONAL SCIENTIFIC CONFERENCE: ENGINEERING FOR RURAL DEVELOPMENT, 2016, : 1048 - 1053
  • [8] Neural Networks with Image Recognition by Pairs
    Geidarov P.S.
    [J]. Optical Memory and Neural Networks, 2018, 27 (2) : 113 - 119
  • [9] Binary Deep Neural Networks for Speech Recognition
    Xiang, Xu
    Qian, Yanmin
    Yu, Kai
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 533 - 537
  • [10] Food Image Recognition with Convolutional Neural Networks
    Zhang, Weishan
    Zhao, Dehai
    Gong, Wenjuan
    Li, Zhongwei
    Lu, Qinghua
    Yang, Su
    [J]. IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 690 - 693