Tutor-Instructing Global Pruning for Accelerating Convolutional Neural Networks

被引：2

作者：

Yu, Fang ^{[1
,2
]}

Cui, Li ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年 / 325卷

基金：

中国国家自然科学基金;

关键词：

D O I：

10.3233/FAIA200420

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model compression and acceleration has recently received ever-increasing research attention. Among them, filter pruning shows a promising effectiveness, due to its merits in significant speedup for inference and support on off-the-shelf computing platforms. Most existing works tend to prune filters in a layer-wise manner, where networks are pruned and fine-tuned layer by layer. However, these methods require intensive computation for per-layer sensitivity analysis and suffer from accumulation of pruning errors. To address these challenges, we propose a novel pruning method, namely Tutor-Instructing global Pruning (TIP), to prune the redundant filters in a global manner. TIP introduces Information Gain (IG) to estimate the contribution of filters to the class probability distributions of network output. The motivation of TIP is to formulate filter pruning as a minimization of the IG with respect to a group of pruned filters under a constraint on the size of pruned network. To solve this problem, we propose a Taylor-based approximate algorithm, which can efficiently obtain the IG of each filter by backpropagation. We comprehensively evaluate our TIP on CIFAR-10 and ILSVRC-12. On ILSVRC-12, TIP reduces FLOPs for ResNet-50 by 54.13% with only a drop in top-5 accuracy by 0.1%, which significantly outperforms the state-of-the-art methods.

引用

下载

页码：2792 / 2799

页数：8

共 50 条

[21] Flattening Layer Pruning in Convolutional Neural Networks
Jeczmionek, Ernest
Kowalski, Piotr A.
SYMMETRY-BASEL, 2021, 13 (07):
[22] Structured Pruning of Deep Convolutional Neural Networks
Anwar, Sajid
Hwang, Kyuyeon
Sung, Wonyong
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
[23] Activation Pruning of Deep Convolutional Neural Networks
Ardakani, Arash
Condo, Carlo
Gross, Warren J.
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1325 - 1329
[24] Blending Pruning Criteria for Convolutional Neural Networks
He, Wei
Huang, Zhongzhan
Liang, Mingfu
Liang, Senwei
Yang, Haizhao
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 3 - 15
[25] Discriminative Layer Pruning for Convolutional Neural Networks
Jordao, Artur
Lie, Maiko
Schwartz, William Robson
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 828 - 837
[26] Reconstruction Error Aware Pruning for Accelerating Neural Networks
Kamma, Koji
Wada, Toshikazu
ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 59 - 72
[27] Filter Pruning via Probabilistic Model-based Optimization for Accelerating Deep Convolutional Neural Networks
Li, Qinghua
Li, Cuiping
Chen, Hong
WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 653 - 661
[28] Channel Pruning for Accelerating Very Deep Neural Networks
He, Yihui
Zhang, Xiangyu
Sun, Jian
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1398 - 1406
[29] FP-AGL: Filter Pruning With Adaptive Gradient Learning for Accelerating Deep Convolutional Neural Networks
Kim, Nam Joon
Kim, Hyun
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 5279 - 5290
[30] Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy
Musat, Bogdan
Andonie, Razvan
2023 27TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION, IV, 2023, : 286 - 291

← 1 2 3 4 5 →