MITA: Multi-Input Adaptive Activation Function for Accurate Binary Neural Network Hardware

被引:0
|
作者
Zhang, Peiqi [1 ]
Takamaeda-yamazaki, Shinya [1 ]
机构
[1] Univ Tokyo, Sch Informat Sci & Technol, Tokyo 1138654, Japan
关键词
binary neural network; adaptive thresholding; algorithm/ hardware co-design; neural network accelerator;
D O I
10.1587/transinf.2023PAP0007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary Neural Networks (BNN) have binarized neuron and connection values so that their accelerators can be realized by extremely efficient hardware. However, there is a significant accuracy gap between BNNs and networks with wider bit-width. Conventional BNNs binarize feature maps by static globally-unified thresholds, which makes the produced bipolar image lose local details. This paper proposes a multi -input activation function to enable adaptive thresholding for binarizing feature maps: (a) At the algorithm level, instead of operating each input pixel independently, adaptive thresholding dynamically changes the thresh-old according to surrounding pixels of the target pixel. When optimizing weights, adaptive thresholding is equivalent to an accompanied depth-wise convolution between normal convolution and binarization. Accompanied weights in the depth-wise filters are ternarized and optimized end-to-end. (b) At the hardware level, adaptive thresholding is realized through a multi -input activation function, which is compatible with common accelerator ar-chitectures. Compact activation hardware with only one extra accumulator is devised. By equipping the proposed method on FPGA, 4.1% accuracy improvement is achieved on the original BNN with only 1.1% extra LUT resource. Compared with State-of-the-art methods, the proposed idea further increases network accuracy by 0.8% on the Cifar-10 dataset and 0.4% on the ImageNet dataset.
引用
收藏
页码:2006 / 2014
页数:9
相关论文
共 50 条
  • [31] Model reference adaptive control for multi-input multi-output nonlinear systems using neural networks
    Phuah, JS
    Lu, JM
    Yahagi, T
    PROCEEDINGS OF THE 2003 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM 2003), VOLS 1 AND 2, 2003, : 12 - 16
  • [32] Model reference adaptive control for multi-input multi-output nonlinear systems using neural networks
    Phuah, J
    Lu, JM
    Yahagi, T
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : D303 - D308
  • [33] Adaptive output feedback control of a class of multi-input multi-output systems using neural networks
    Hovakimyan, N
    Calise, AJ
    Kim, N
    INTERNATIONAL JOURNAL OF CONTROL, 2004, 77 (15) : 1318 - 1329
  • [34] Online neural identification of multi-input multi-output systems
    Bazaei, A.
    Moallem, M.
    IET CONTROL THEORY AND APPLICATIONS, 2007, 1 (01): : 44 - 50
  • [35] Nonlinear adaptive trajectory control of multi-input multi-output submarines with input constraints
    Lee, Keum W.
    Singh, Sahjendra N.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2016, 230 (02) : 164 - 183
  • [36] Adaptive Neural Network Finite-Time Control for Multi-Input and Multi-Output Nonlinear Systems With Positive Powers of Odd Rational Numbers
    Li, Yongming
    Li, Kewen
    Tong, Shaocheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2532 - 2543
  • [37] A Multi-Input Neural Network with Dense Flow and Spatio-Temporal Features for Anomaly Detection
    Prabakaran, Anirudh
    Voleti, Mallikaijuna Rao
    Lakshya
    Manurkar, Prathmesh Shirish
    2019 FIFTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO): INTERNET OF THINGS, 2019, : 34 - 39
  • [38] A multi-input lightweight convolutional neural network for breast cancer detection considering infrared thermography
    Tang, Yundong
    Zhou, Depei
    Flesch, Rodolfo C.C.
    Jin, Tao
    Expert Systems with Applications, 2025, 263
  • [39] RAD-BNN: Regulating activation distribution for accurate binary neural network
    Yuan M.
    Pei S.
    Image and Vision Computing, 2024, 148
  • [40] A fault diagnosis model for rolling bearings based on a multi-input layer convolutional neural network
    Zan T.
    Wang H.
    Liu Z.
    Wang M.
    Gao X.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2020, 39 (12): : 142 - 149and163