Redundancy-Aware Pruning of Convolutional Neural Networks

被引:3
|
作者
Xie, Guotian [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Guangdong Key Lab Informat Secur Technol, Guangzhou 510006, Guangdong, Peoples R China
关键词
Convolutional neural networks - Digital arithmetic - Convolution - Redundancy;
D O I
10.1162/neco_a_01330
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pruning is an effective way to slim and speed up convolutional neural networks. Generally previous work directly pruned neural networks in the original feature space without considering the correlation of neurons. We argue that such a way of pruning still keeps some redundancy in the pruned networks. In this letter, we proposed to prune in the intermediate space in which the correlation of neurons is eliminated. To achieve this goal, the input and output of a convolutional layer are first mapped to an intermediate space by orthogonal transformation. Then neurons are evaluated and pruned in the intermediate space. Extensive experiments have shown that our redundancy-aware pruning method surpasses state-of-the-art pruning methods on both efficiency and accuracy. Notably, using our redundancy-aware pruning method, ResNet models with three times the speed-up could achieve competitive performance with fewer floating point operations per second even compared to DenseNet.
引用
收藏
页码:2482 / 2506
页数:25
相关论文
共 50 条
  • [11] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Heidorn, Christian
    Sabih, Muhammad
    Meyerhoefer, Nicolai
    Schinabeck, Christian
    Teich, Juergen
    Hannig, Frank
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
  • [12] Redundancy-Aware Action Spaces for Robot Learning
    Mazzaglia, Pietro
    Backshall, Nicholas
    Ma, Xiao
    James, Stephen
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 6912 - 6919
  • [13] Redundancy-aware Transformer for Video Question Answering
    Li, Yicong
    Yang, Xun
    Zhang, An
    Feng, Chun
    Wang, Xiang
    Chua, Tat-Seng
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3172 - 3180
  • [14] HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration
    Yu, Fang
    Han, Chuanqi
    Wang, Pengcheng
    Huang, Ruoran
    Huang, Xi
    Cui, Li
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 255 - 262
  • [15] Inference-aware convolutional neural network pruning
    Choudhary, Tejalal
    Mishra, Vipul
    Goswami, Anurag
    Sarangapani, Jagannathan
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 44 - 56
  • [16] ReACT: Redundancy-Aware Code Generation for Tensor Expressions
    Zhou, Tong
    Tian, Ruiqin
    Ashraf, Rizwan A.
    Gioiosa, Roberto
    Kestor, Gokcen
    Sarkar, Vivek
    [J]. PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 1 - 13
  • [17] Redundancy-Aware Electromigration Checking for Mesh Power Grids
    Chatterjee, Sandeep
    Fawaz, Mohammad
    Najm, Farid N.
    [J]. 2013 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2013, : 540 - 547
  • [18] Towards a Redundancy-Aware Network Stack for Data Centers
    Iftikhar, Ali Musa
    Dogar, Fahad R.
    Qazi, Ihsan Ayyub
    [J]. PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 57 - 63
  • [19] Redundancy-Aware Topic Modeling for Patient Record Notes
    Cohen, Raphael
    Aviram, Iddo
    Elhadad, Michael
    Elhadad, Noemie
    [J]. PLOS ONE, 2014, 9 (02):
  • [20] Energy-aware redundancy-aware clustering in wireless sensor networks using Spined Loach Searching Optimization
    Sasikala, N.
    Sangaiah, Pavalarajan
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2023, 36 (03)