Integrated Optimization in Training Process for Binary Neural Network

被引：0

作者：

Quang Hieu Vo ^{[1
]}

Hong, Sang Hoon ^{[2
]}

Kim, Lok-Won ^{[1
]}

Hong, Choong Seon ^{[1
]}

机构：

[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea

[2] Kyung Hee Univ, Dept Elect Engn, Yongin 17104, South Korea

来源：

2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN | 2023年

基金：

新加坡国家研究基金会;

关键词：

Binary Neural Network; Deep Neural Network; Deep Learning; Machine Learning;

D O I：

10.1109/ICOIN56518.2023.10048969

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Neural Networks (DNNs) have recently become larger and deeper to keep up with more complex applications, resulting in high power and memory consumption. Due to simplicity in computation and storage, Binary Neural Networks (BNNs) have been one of the potential approaches to overcome these challenges. Previous works proposed many techniques to mitigate the accuracy degradation because of less bit-width representation. However, each technique follows different optimization directions, while the combination can gain better results. In addition, the padding value which is an essential factor directly affecting the accuracy and inference implementation has not been touched on in the state-of-the-art solutions. In this paper, based on the previous works, an integrated approach is applied in the training process for BNNs to improve accuracy and training stability. In particular, to increase the probability of changing weights' sign, the ReCU function proposed in related work is used to transform full-precision weight to binary weight, while to make the gradient mismatch of the sign function closer to the real one, the training-aware approximation function is used to replace the sign function. Besides, to make the BNNs compatible with post-XNOR implementation, the padding value for convolution is proposed to change to minus one from the default zero. The integrated method is implemented on the Cifar-10 dataset with VGG-small model shows that the training process is more stable with higher accuracy, compared to the baseline, while the model architecture and training algorithm are preserved.

引用

页码：545 / 548

页数：4

共 50 条

[1] Training Optimization of Feedforward Neural Network for Binary Classification
Thawakar, Omkar
Gajjewar, Pranav
[J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2019), 2019,
[2] Integrated Input Training Neural Network PCA and RBF for Chemical Process Modelling
Geng, Zhiqiang
Wang, Yanqing
Zhang, Yuanyuan
Zhu, Qunxiong
[J]. MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 467-469 : 469 - 474
[3] Enabling Binary Neural Network Training on the Edge
Wang, Erwei
Davis, James J.
Moro, Daniele
Zielinski, Piotr
Lim, Jia Jie
Coelho, Claudionor
Chatterjee, Satrajit
Cheung, Peter Y. K.
Constantinides, George A.
[J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)
[4] Global optimization for neural network training
Shang, Y
Wah, BW
[J]. COMPUTER, 1996, 29 (03) : 45 - +
[5] Neural network training as a dissipative process
Gori, Marco
Maggini, Marco
Rossi, Alessandro
[J]. NEURAL NETWORKS, 2016, 81 : 72 - 80
[6] A Neural Network Strategy for Process Optimization
Hung, Shiu-Wan
[J]. PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 1037 - 1039
[7] Integrated CS optimization and OLS for recurrent neural network in modeling microwave thermal process
Tong Liu
Shan Liang
Qingyu Xiong
Kai Wang
[J]. Neural Computing and Applications, 2020, 32 : 12267 - 12280
[8] Integrated CS optimization and OLS for recurrent neural network in modeling microwave thermal process
Liu, Tong
Liang, Shan
Xiong, Qingyu
Wang, Kai
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12267 - 12280
[9] Neural Network Training Schemes for Antenna Optimization
Linh Ho Manh
Grimaccia, Francesco
Mussetta, Marco
Zich, Riccardo E.
[J]. 2014 IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM (APSURSI), 2014, : 1948 - 1949
[10] Instance Selection Optimization for Neural Network Training
Kordos, Miroslaw
[J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 610 - 620

← 1 2 3 4 5 →