Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network

被引：45

作者：

Zheng, Qinghe ^{[1
]}

Tian, Xinyu ^{[2
]}

Jiang, Nan ^{[3
]}

Yang, Mingqiang ^{[1
]}

机构：

[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Shandong, Peoples R China

[2] Shandong Management Univ, Coll Mech & Elect Engn, Jinan, Shandong, Peoples R China

[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan, Hubei, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2019年 / 37卷 / 04期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Deep learning; deep CNNs; non-convex optimization; SGD; layer-wise learning; FUZZY; ALGORITHM;

D O I：

10.3233/JIFS-190861

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, despite the popularity of deep convolutional neural networks (CNNs), the efficient training of network models remains challenging due to several problems. In this paper, we present a layer-wise learning based stochastic gradient descent method (LLb-SGD) for gradient-based optimization of objective functions in deep learning, which is simple and computationally efficient. By simulating the cross-media propagation mechanism of light in the natural environment, we set an adaptive learning rate for each layer of neural networks. In order to find the proper local optimum quickly, the dynamic learning sequence spanning different layers adaptively adjust the descending speed of objective function in multi-scale and multi-dimensional environment. To the best of our knowledge, this is the first attempt to introduce an adaptive layer-wise learning schedule with a certain degree of convergence guarantee. Due to its generality and robustness, the method is insensitive to hyper-parameters and therefore can be applied to various network architectures and datasets. Finally, we show promising results compared to other optimization methods on two image classification benchmarks using five standard networks.

引用

页码：5641 / 5654

页数：14

共 50 条

[1] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
Huu-Thiet Nguyen
Li, Sitan
Cheah, Chien Chern
[J]. IEEE ACCESS, 2022, 10 : 14270 - 14287
[2] Optimization Based Layer-Wise Pruning Threshold Method for Accelerating Convolutional Neural Networks
Ding, Yunlong
Chen, Di-Rong
[J]. MATHEMATICS, 2023, 11 (15)
[3] Stochastic Layer-Wise Precision in Deep Neural Networks
Lacey, Griffin
Taylor, Graham W.
Areibi, Shawki
[J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 663 - 672
[4] Craft Distillation: Layer-wise Convolutional Neural Network Distillation
Blakeney, Cody
Li, Xiaomin
Yan, Yan
Zong, Ziliang
[J]. 2020 7TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (CSCLOUD 2020)/2020 6TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (EDGECOM 2020), 2020, : 252 - 257
[5] LAYER-WISE DEEP NEURAL NETWORK PRUNING VIA ITERATIVELY REWEIGHTED OPTIMIZATION
Jiang, Tao
Yang, Xiangyu
Shi, Yuanming
Wang, Hao
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5606 - 5610
[6] Pruning Ratio Optimization with Layer-Wise Pruning Method for Accelerating Convolutional Neural Networks
Kamma, Koji
Inoue, Sarimu
Wada, Toshikazu
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (01) : 161 - 169
[7] Deep Convolutional Neural Networks with Layer-wise Context Expansion and Attention
Yu, Dong
Xiong, Wayne
Droppo, Jasha
Stolcke, Andreas
Ye, Guoli
Li, Jinyu
Zweig, Geoffrey
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 17 - 21
[8] The layer-wise method and the backpropagation hybrid approach to learning a feedforward neural network
Rubanov, NS
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (02): : 295 - 305
[9] Stochastic Gradient Descent-Whale Optimization Algorithm-Based Deep Convolutional Neural Network To Crowd Emotion Understanding
Ratre, Avinash
[J]. COMPUTER JOURNAL, 2020, 63 (02): : 267 - 282
[10] Collaborative Layer-Wise Discriminative Learning in Deep Neural Networks
Jin, Xiaojie
Chen, Yunpeng
Dong, Jian
Feng, Jiashi
Yan, Shuicheng
[J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 733 - 749

← 1 2 3 4 5 →