Operation-Aware Soft Channel Pruning using Differentiable Masks

被引：0

作者：

Kang, Minsoo ^{[1
,2
]}

Han, Bohyung ^{[1
,2
]}

机构：

[1] Seoul Natl Univ, Dept Elect & Comp Engn, Comp Vis Lab, Seoul, South Korea

[2] Seoul Natl Univ, ASRI, Seoul, South Korea

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 | 2020年 / 119卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a simple but effective data-driven channel pruning algorithm, which compresses deep neural networks in a differentiable way by exploiting the characteristics of operations. The proposed approach makes a joint consideration of batch normalization (BN) and rectified linear unit (ReLU) for channel pruning; it estimates how likely the two successive operations deactivate each feature map and prunes the channels with high probabilities. To this end, we learn differentiable masks for individual channels and make soft decisions throughout the optimization procedure, which facilitates to explore larger search space and train more stable networks. The proposed framework enables us to identify compressed models via a joint learning of model parameters and channel pruning without an extra procedure of fine-tuning. We perform extensive experiments and achieve outstanding performance in terms of the accuracy of output networks given the same amount of resources when compared with the state-of-the-art methods.

引用

页数：10

共 50 条

[1] Differentiable Network Pruning via Polarization of Probabilistic Channelwise Soft Masks
Ma, Ming
Wang, Jiapeng
Yu, Zhenhua
[J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[2] Operation-Aware Power Capping
Wang, Bo
Miller, Julian
Terboven, Christian
Mueller, Matthias
[J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 68 - 82
[3] Operation-aware Neural Networks for user response prediction
Yang, Yi
Xu, Baile
Shen, Shaofeng
Shen, Furao
Zhao, Jian
[J]. NEURAL NETWORKS, 2020, 121 : 161 - 168
[4] Model Compression Based on Differentiable Network Channel Pruning
Zheng, Yu-Jie
Chen, Si-Bao
Ding, Chris H. Q.
Luo, Bin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10203 - 10212
[5] DMCP: Differentiable Markov Channel Pruning for Neural Networks
Guo, Shaopeng
Wang, Yujie
Li, Quanquan
Yan, Junjie
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1536 - 1544
[6] Operation-Aware Assist Circuit Design for Improved Write Performance of FinFET based SRAM
Prajapati, Ekta
Yadav, Nandakishor
Pattanaik, Manisha
Sharma, G. K.
[J]. 18TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST, 2014,
[7] Degradation and Operation-Aware Framework for the Optimal Siting, Sizing, and Technology Selection of Battery Storage
Sayfutdinov, Timur
Patsios, Charalampos
Vorobev, Petr
Gryazina, Elena
Greenwood, David M.
Bialek, Janusz W.
Taylor, Philip C.
[J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (04) : 2130 - 2140
[8] Operation and Topology Aware Fast Differentiable Architecture Search
Siddiqui, Shahid
Kyrkou, Christos
Theocharides, Theocharis
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9666 - 9673
[9] Localization -aware channel pruning for object detection
Xie, Zihao
Zhu, Li
Zhao, Lin
Tao, Bo
Liu, Liman
Tao, Wenbing
[J]. NEUROCOMPUTING, 2020, 403 : 400 - 408
[10] Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach
Hanjing Cheng
Zidong Wang
Lifeng Ma
Zhihui Wei
Fawaz E. Alsaadi
Xiaohui Liu
[J]. Complex & Intelligent Systems, 2023, 9 : 5611 - 5624

← 1 2 3 4 5 →