LPSD: Low-Rank Plus Sparse Decomposition for Highly Compressed CNN Models

被引:1
|
作者
Huang, Kuei-Hsiang [1 ]
Sie, Cheng-Yu [1 ]
Lin, Jhong-En [1 ]
Lee, Che-Rung [1 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
关键词
D O I
10.1007/978-981-97-2242-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-rank decomposition that explores and eliminates the linear dependency within a tensor is often used as a structured model pruning method for deep convolutional neural networks. However, the model accuracy declines rapidly as the compression ratio increases over a threshold. We have observed that with a small amount of sparse elements, the model accuracy can be recovered significantly for the highly compressed CNN models. Based on this premise, we developed a novel method, called LPSD (Low-rank Plus Sparse Decomposition), that decomposes a CNN weight tensor into a combination of a low-rank and a sparse components, which can better maintain the accuracy for the high compression ratio. For a pretrained model, the network structure of each layer is split into two branches: one for low-rank part and one for sparse part. LPSD adapts the alternating approximation algorithm to minimize the global error and the local error alternatively. An exhausted search method with pruning is designed to search the optimal group number, ranks, and sparsity. Experimental results demonstrate that in most scenarios, LPSD achieves better accuracy compared to the state-of-the-art methods when the model is highly compressed.
引用
收藏
页码:353 / 364
页数:12
相关论文
共 50 条
  • [1] Compressed sensing of low-rank plus sparse matrices
    Tanner, Jared
    Vary, Simon
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2023, 64 : 254 - 293
  • [2] EXACT RECOVERY OF LOW-RANK PLUS COMPRESSED SPARSE MATRICES
    Mardani, Morteza
    Mateos, Gonzalo
    Giannakis, Georgios B.
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 49 - 52
  • [3] Nonconvex Splitting for Regularized Low-Rank plus Sparse Decomposition
    Chartrand, Rick
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (11) : 5810 - 5819
  • [4] Sparse and Low-Rank Tensor Decomposition
    Shah, Parikshit
    Rao, Nikhil
    Tang, Gongguo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [5] Robust Low-Rank and Sparse Tensor Decomposition for Low-Rank Tensor Completion
    Shi, Yuqing
    Du, Shiqiang
    Wang, Weilan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 7138 - 7143
  • [6] Pairwise sparse plus low-rank models for variables of mixed type
    Nussbaum, Frank
    Giesen, Joachim
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 178
  • [7] Recovery of Low-Rank Plus Compressed Sparse Matrices With Application to Unveiling Traffic Anomalies
    Mardani, Morteza
    Mateos, Gonzalo
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (08) : 5186 - 5205
  • [8] Pansharpening Based on Low-Rank and Sparse Decomposition
    Rong, Kaixuan
    Jiao, Licheng
    Wang, Shuang
    Liu, Fang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (12) : 4793 - 4805
  • [9] Robust Identification of "Sparse Plus Low-rank" Graphical Models: An Optimization Approach
    Ciccone, Valentina
    Ferrante, Augusto
    Zorzi, Mattia
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2241 - 2246
  • [10] Direction-of-Arrival Tracking via Low-Rank Plus Sparse Matrix Decomposition
    Lin, Bo
    Liu, Jiying
    Xie, Meihua
    Zhu, Jubo
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2015, 14 : 1302 - 1305