Deep neural networks compression: A comparative survey and choice recommendations

被引:25
|
作者
Marino, Giosue Cataldo [1 ]
Petrini, Alessandro [1 ]
Malchiodi, Dario [1 ]
Frasca, Marco [1 ]
机构
[1] Univ Milan, Dipartimento Informat, Via Celoria 18, I-20133 Milan, Italy
关键词
CNN compression; Connection pruning; Weight quantization; Weight sharing; Huffman coding; Succinct Deep Neural Networks; WEIGHTS;
D O I
10.1016/j.neucom.2022.11.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The state-of-the-art performance for several real-world problems is currently reached by deep and, in particular, convolutional neural networks (CNN). Such learning models exploit recent results in the field of deep learning, leading to highly performing, yet very large neural networks with typically millions to billions of parameters. As a result, such models are often redundant and excessively oversized, with a detrimental effect on the environment in terms of unnecessary energy consumption and a limitation to their deployment on low-resource devices. The necessity for compression techniques able to reduce the number of model parameters and their resource demand is thereby increasingly felt by the research community. In this paper we propose the first extensive comparison, to the best of our knowledge, of the main lossy and structure-preserving approaches to compress pre-trained CNNs, applicable in principle to any existing model. Our study is intended to provide a first and preliminary guidance to choose the most suitable compression technique when there is the need to reduce the occupancy of pre-trained models. Both convolutional and fully-connected layers are included in the analysis. Our experiments involved two pre-trained state-of-the-art CNNs (proposed to solve classification or regression problems) and five benchmarks, and gave rise to important insights about the applicability and performance of such tech-niques w.r.t. the type of layer to be compressed and the category of problem tackled.(c) 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:152 / 170
页数:19
相关论文
共 50 条
  • [31] Reducing Image Compression Artifacts for Deep Neural Networks
    Ma, Li
    Peng, Peixi
    Xing, Peiyin
    Wang, Yaowei
    Tian, Yonghong
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 355 - 355
  • [32] Deep neural networks compression based on improved clustering
    Liu H.
    Wang Y.
    Ma Y.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (07): : 1130 - 1136
  • [33] Compression in Wireless Sensor Networks: A Survey and Comparative Evaluation
    Razzaque, M. A.
    Bleakley, Chris
    Dobson, Simon
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2013, 10 (01)
  • [34] Fast and Robust Compression of Deep Convolutional Neural Networks
    Wen, Jia
    Yang, Liu
    Shen, Chenyang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 52 - 63
  • [35] Improvement of Image Compression Performance by Deep Neural Networks
    Vasylenko, Dmytro
    Stirenko, Sergii
    Gordienko, Yuri
    IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 135 - 139
  • [36] Deep OCT image compression with convolutional neural networks
    Guo, Pengfei
    Li, Dawei
    Li, Xingde
    BIOMEDICAL OPTICS EXPRESS, 2020, 11 (07): : 3543 - 3554
  • [37] Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey
    Deng, Lei
    Li, Guoqi
    Han, Song
    Shi, Luping
    Xie, Yuan
    PROCEEDINGS OF THE IEEE, 2020, 108 (04) : 485 - 532
  • [38] Are Deep Neural Networks the Best Choice for Modeling Source Code?
    Hellendoorn, Vincent J.
    Devanbu, Premkumar
    ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2017, : 763 - 773
  • [39] Joint matrix decomposition for deep convolutional neural networks compression
    Chen, Shaowu
    Zhou, Jiahao
    Sun, Weize
    Huang, Lei
    NEUROCOMPUTING, 2023, 516 : 11 - 26
  • [40] Evolutionary Compression of Deep Neural Networks for Biomedical Image Segmentation
    Zhou, Yao
    Yen, Gary G.
    Yi, Zhang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2916 - 2929