Discriminative Layer Pruning for Convolutional Neural Networks

被引：26

作者：

Jordao, Artur ^{[1
]}

Lie, Maiko ^{[1
]}

Schwartz, William Robson ^{[1
]}

机构：

[1] Univ Fed Minas Gerais, Dept Comp Sci, Smart Sense Lab, BR-31270901 Belo Horizonte, MG, Brazil

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2020年 / 14卷 / 04期

关键词：

Computer architecture; Estimation; Convolutional neural networks; Computational efficiency; Internet of Things; Visualization; Network compression; network pruning; convolutional neural networks;

D O I：

10.1109/JSTSP.2020.2975987

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The predictive ability of convolutional neural networks (CNNs) can be improved by increasing their depth. However, increasing depth also increases computational cost significantly, in terms of both floating point operations and memory consumption, hindering applicability on resource-constrained systems such as mobile and internet of things (IoT) devices. Fortunately, most networks have spare capacity, that is, they require fewer parameters than they actually have to perform accurately. This motivates network compression methods, which remove or quantize parameters to improve resource-efficiency. In this work, we consider a straightforward strategy for removing entire convolutional layers to reduce network depth. Since it focuses on depth, this approach not only reduces memory usage, but also reduces prediction time significantly by mitigating the serialization overhead incurred by forwarding through consecutive layers. We show that a simple subspace projection approach can be employed to estimate the importance of network layers, enabling the pruning of CNNs to a resource-efficient depth within a given network size constraint. We estimate importance on a subspace computed using Partial Least Squares, a feature projection approach that preserves discriminative information. Consequently, this importance estimation is correlated to the contribution of the layer to the classification ability of the model. We show that cascading discriminative layer pruning with filter-oriented pruning improves the resource-efficiency of the resulting network compared to using any of them alone, and that it outperforms state-of-the-art methods. Moreover, we show that discriminative layer pruning alone, without cascading, achieves competitive resource-efficiency compared to methods that prune filters from all layers.

引用

页码：828 / 837

页数：10

共 50 条

[1] Flattening Layer Pruning in Convolutional Neural Networks
Jeczmionek, Ernest
Kowalski, Piotr A.
SYMMETRY-BASEL, 2021, 13 (07):
[2] Pruning Ratio Optimization with Layer-Wise Pruning Method for Accelerating Convolutional Neural Networks
Kamma, Koji
Inoue, Sarimu
Wada, Toshikazu
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (01) : 161 - 169
[3] Layer Pruning via Fusible Residual Convolutional Block for Deep Neural Networks
Xu P.
Cao J.
Sun W.
Li P.
Wang Y.
Zhang X.
Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (05): : 801 - 807
[4] Iterative clustering pruning for convolutional neural networks
Chang, Jingfei
Lu, Yang
Xue, Ping
Xu, Yiqun
Wei, Zhen
KNOWLEDGE-BASED SYSTEMS, 2023, 265
[5] Leveraging Structured Pruning of Convolutional Neural Networks
Tessier, Hugo
Gripon, Vincent
Leonardon, Mathieu
Arzel, Matthieu
Bertrand, David
Hannagan, Thomas
2022 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2022, : 174 - 179
[6] Structured Pruning of Deep Convolutional Neural Networks
Anwar, Sajid
Hwang, Kyuyeon
Sung, Wonyong
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
[7] Activation Pruning of Deep Convolutional Neural Networks
Ardakani, Arash
Condo, Carlo
Gross, Warren J.
2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1325 - 1329
[8] Blending Pruning Criteria for Convolutional Neural Networks
He, Wei
Huang, Zhongzhan
Liang, Mingfu
Liang, Senwei
Yang, Haizhao
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 3 - 15
[9] Accelerator-Aware Pruning for Convolutional Neural Networks
Kang, Hyeong-Ju
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2093 - 2103
[10] Pruning feature maps for efficient convolutional neural networks
Guo, Xiao-ting
Xie, Xin-shu
Lang, Xun
OPTIK, 2023, 281

← 1 2 3 4 5 →