Integrated Optimization in Training Process for Binary Neural Network

被引:0
|
作者
Quang Hieu Vo [1 ]
Hong, Sang Hoon [2 ]
Kim, Lok-Won [1 ]
Hong, Choong Seon [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea
[2] Kyung Hee Univ, Dept Elect Engn, Yongin 17104, South Korea
基金
新加坡国家研究基金会;
关键词
Binary Neural Network; Deep Neural Network; Deep Learning; Machine Learning;
D O I
10.1109/ICOIN56518.2023.10048969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) have recently become larger and deeper to keep up with more complex applications, resulting in high power and memory consumption. Due to simplicity in computation and storage, Binary Neural Networks (BNNs) have been one of the potential approaches to overcome these challenges. Previous works proposed many techniques to mitigate the accuracy degradation because of less bit-width representation. However, each technique follows different optimization directions, while the combination can gain better results. In addition, the padding value which is an essential factor directly affecting the accuracy and inference implementation has not been touched on in the state-of-the-art solutions. In this paper, based on the previous works, an integrated approach is applied in the training process for BNNs to improve accuracy and training stability. In particular, to increase the probability of changing weights' sign, the ReCU function proposed in related work is used to transform full-precision weight to binary weight, while to make the gradient mismatch of the sign function closer to the real one, the training-aware approximation function is used to replace the sign function. Besides, to make the BNNs compatible with post-XNOR implementation, the padding value for convolution is proposed to change to minus one from the default zero. The integrated method is implemented on the Cifar-10 dataset with VGG-small model shows that the training process is more stable with higher accuracy, compared to the baseline, while the model architecture and training algorithm are preserved.
引用
收藏
页码:545 / 548
页数:4
相关论文
共 50 条
  • [1] Training Optimization of Feedforward Neural Network for Binary Classification
    Thawakar, Omkar
    Gajjewar, Pranav
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2019), 2019,
  • [2] Integrated Input Training Neural Network PCA and RBF for Chemical Process Modelling
    Geng, Zhiqiang
    Wang, Yanqing
    Zhang, Yuanyuan
    Zhu, Qunxiong
    [J]. MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 467-469 : 469 - 474
  • [3] Enabling Binary Neural Network Training on the Edge
    Wang, Erwei
    Davis, James J.
    Moro, Daniele
    Zielinski, Piotr
    Lim, Jia Jie
    Coelho, Claudionor
    Chatterjee, Satrajit
    Cheung, Peter Y. K.
    Constantinides, George A.
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)
  • [4] Global optimization for neural network training
    Shang, Y
    Wah, BW
    [J]. COMPUTER, 1996, 29 (03) : 45 - +
  • [5] Neural network training as a dissipative process
    Gori, Marco
    Maggini, Marco
    Rossi, Alessandro
    [J]. NEURAL NETWORKS, 2016, 81 : 72 - 80
  • [6] A Neural Network Strategy for Process Optimization
    Hung, Shiu-Wan
    [J]. PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 1037 - 1039
  • [7] Integrated CS optimization and OLS for recurrent neural network in modeling microwave thermal process
    Tong Liu
    Shan Liang
    Qingyu Xiong
    Kai Wang
    [J]. Neural Computing and Applications, 2020, 32 : 12267 - 12280
  • [8] Integrated CS optimization and OLS for recurrent neural network in modeling microwave thermal process
    Liu, Tong
    Liang, Shan
    Xiong, Qingyu
    Wang, Kai
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12267 - 12280
  • [9] Neural Network Training Schemes for Antenna Optimization
    Linh Ho Manh
    Grimaccia, Francesco
    Mussetta, Marco
    Zich, Riccardo E.
    [J]. 2014 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM (APSURSI), 2014, : 1948 - 1949
  • [10] Instance Selection Optimization for Neural Network Training
    Kordos, Miroslaw
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 610 - 620