Layer-Wise Training to Create Efficient Convolutional Neural Networks

被引：1

作者：

Zeng, Linghua ^{[1
]}

Tian, Xinmei ^{[1
]}

机构：

[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China

来源：

NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II | 2017年 / 10635卷

关键词：

Deep learning; Network compression; Layer-wise training;

D O I：

10.1007/978-3-319-70096-0_65

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent large CNNs have delivered impressive performance but their storage requirement and computational cost limit a wide range of their applications in mobile devices and large-scale Internet industry. Works focusing on storage compression have led a great success. Recently how to reduce computational cost draws more attention. In this paper, we propose an algorithm to reduce computational cost, which is often solved by sparsification and matrix decomposition methods. Since the computation is dominated by the convolutional operations, we focus on the compression of convolutional layers. Unlike sparsification and matrix decomposition methods which usually derive from mathematics, we receive inspiration from transfer learning and biological neural networks. We transfer the knowledge in state-of-the-art large networks to compressed small ones, via layer-wise training. We replace the complex convolutional layers in large networks with more efficient modules and keep their outputs in each-layer consistent. Modules in the compressed small networks are more efficient, and their design draws on biological neural networks. For AlexNet model, we achieve 3.62x speedup, with 0.11% top-5 error rate increase. For VGG model, we achieve 5.67x speedup, with 0.43% top-5 error rate increase.

引用

页码：631 / 641

页数：11

共 50 条

[1] Layer-Wise Compressive Training for Convolutional Neural Networks
Grimaldi, Matteo
Tenace, Valerio
Calimera, Andrea
[J]. FUTURE INTERNET, 2019, 11 (01)
[2] L2-GCN Layer-Wise and Learned Efficient Training of Graph Convolutional Networks
You, Yuning
Chen, Tianlong
Wang, Zhangyang
Shen, Yang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2124 - 2132
[3] Interpreting Convolutional Neural Networks via Layer-Wise Relevance Propagation
Jia, Wohuan
Zhang, Shaoshuai
Jiang, Yue
Xu, Li
[J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 457 - 467
[4] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
Huu-Thiet Nguyen
Li, Sitan
Cheah, Chien Chern
[J]. IEEE ACCESS, 2022, 10 : 14270 - 14287
[5] Deep Convolutional Neural Networks with Layer-wise Context Expansion and Attention
Yu, Dong
Xiong, Wayne
Droppo, Jasha
Stolcke, Andreas
Ye, Guoli
Li, Jinyu
Zweig, Geoffrey
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 17 - 21
[6] Supervised Greedy Layer-Wise Training for Deep Convolutional Networks with Small Datasets
Rueda-Plata, Diego
Ramos-Pollan, Raul
Gonzalez, Fabio A.
[J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 : 275 - 284
[7] Implementation of Lightweight Convolutional Neural Networks via Layer-Wise Differentiable Compression
Diao, Huabin
Hao, Yuexing
Xu, Shaoyun
Li, Gongyan
[J]. SENSORS, 2021, 21 (10)
[8] Activation Distribution-based Layer-wise Quantization for Convolutional Neural Networks
Ki, Subin
Kim, Hyun
[J]. 2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[9] SPSA for Layer-Wise Training of Deep Networks
Wulff, Benjamin
Schuecker, Jannis
Bauckhage, Christian
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 564 - 573
[10] Layer-Wise Training Convolutional Neural Networks With Smaller Filters for Human Activity Recognition Using Wearable Sensors
Tang, Yin
Teng, Qi
Zhang, Lei
Min, Fuhong
He, Jun
[J]. IEEE SENSORS JOURNAL, 2021, 21 (01) : 581 - 592

← 1 2 3 4 5 →