Operation-Aware Soft Channel Pruning using Differentiable Masks

被引:0
|
作者
Kang, Minsoo [1 ,2 ]
Han, Bohyung [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Comp Vis Lab, Seoul, South Korea
[2] Seoul Natl Univ, ASRI, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a simple but effective data-driven channel pruning algorithm, which compresses deep neural networks in a differentiable way by exploiting the characteristics of operations. The proposed approach makes a joint consideration of batch normalization (BN) and rectified linear unit (ReLU) for channel pruning; it estimates how likely the two successive operations deactivate each feature map and prunes the channels with high probabilities. To this end, we learn differentiable masks for individual channels and make soft decisions throughout the optimization procedure, which facilitates to explore larger search space and train more stable networks. The proposed framework enables us to identify compressed models via a joint learning of model parameters and channel pruning without an extra procedure of fine-tuning. We perform extensive experiments and achieve outstanding performance in terms of the accuracy of output networks given the same amount of resources when compared with the state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Differentiable Network Pruning via Polarization of Probabilistic Channelwise Soft Masks
    Ma, Ming
    Wang, Jiapeng
    Yu, Zhenhua
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [2] Operation-Aware Power Capping
    Wang, Bo
    Miller, Julian
    Terboven, Christian
    Mueller, Matthias
    [J]. EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 68 - 82
  • [3] Operation-aware Neural Networks for user response prediction
    Yang, Yi
    Xu, Baile
    Shen, Shaofeng
    Shen, Furao
    Zhao, Jian
    [J]. NEURAL NETWORKS, 2020, 121 : 161 - 168
  • [4] Model Compression Based on Differentiable Network Channel Pruning
    Zheng, Yu-Jie
    Chen, Si-Bao
    Ding, Chris H. Q.
    Luo, Bin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10203 - 10212
  • [5] DMCP: Differentiable Markov Channel Pruning for Neural Networks
    Guo, Shaopeng
    Wang, Yujie
    Li, Quanquan
    Yan, Junjie
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1536 - 1544
  • [6] Operation-Aware Assist Circuit Design for Improved Write Performance of FinFET based SRAM
    Prajapati, Ekta
    Yadav, Nandakishor
    Pattanaik, Manisha
    Sharma, G. K.
    [J]. 18TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST, 2014,
  • [7] Degradation and Operation-Aware Framework for the Optimal Siting, Sizing, and Technology Selection of Battery Storage
    Sayfutdinov, Timur
    Patsios, Charalampos
    Vorobev, Petr
    Gryazina, Elena
    Greenwood, David M.
    Bialek, Janusz W.
    Taylor, Philip C.
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (04) : 2130 - 2140
  • [8] Operation and Topology Aware Fast Differentiable Architecture Search
    Siddiqui, Shahid
    Kyrkou, Christos
    Theocharides, Theocharis
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9666 - 9673
  • [9] Localization -aware channel pruning for object detection
    Xie, Zihao
    Zhu, Li
    Zhao, Lin
    Tao, Bo
    Liu, Liman
    Tao, Wenbing
    [J]. NEUROCOMPUTING, 2020, 403 : 400 - 408
  • [10] Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach
    Hanjing Cheng
    Zidong Wang
    Lifeng Ma
    Zhihui Wei
    Fawaz E. Alsaadi
    Xiaohui Liu
    [J]. Complex & Intelligent Systems, 2023, 9 : 5611 - 5624