LPSD: Low-Rank Plus Sparse Decomposition for Highly Compressed CNN Models

被引:1
|
作者
Huang, Kuei-Hsiang [1 ]
Sie, Cheng-Yu [1 ]
Lin, Jhong-En [1 ]
Lee, Che-Rung [1 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
关键词
D O I
10.1007/978-981-97-2242-6_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-rank decomposition that explores and eliminates the linear dependency within a tensor is often used as a structured model pruning method for deep convolutional neural networks. However, the model accuracy declines rapidly as the compression ratio increases over a threshold. We have observed that with a small amount of sparse elements, the model accuracy can be recovered significantly for the highly compressed CNN models. Based on this premise, we developed a novel method, called LPSD (Low-rank Plus Sparse Decomposition), that decomposes a CNN weight tensor into a combination of a low-rank and a sparse components, which can better maintain the accuracy for the high compression ratio. For a pretrained model, the network structure of each layer is split into two branches: one for low-rank part and one for sparse part. LPSD adapts the alternating approximation algorithm to minimize the global error and the local error alternatively. An exhausted search method with pruning is designed to search the optimal group number, ranks, and sparsity. Experimental results demonstrate that in most scenarios, LPSD achieves better accuracy compared to the state-of-the-art methods when the model is highly compressed.
引用
收藏
页码:353 / 364
页数:12
相关论文
共 50 条
  • [21] SOUND FIELD DECOMPOSITION IN REVERBERANT ENVIRONMENT USING SPARSE AND LOW-RANK SIGNAL MODELS
    Koyama, Shoichi
    Saruwatari, Hiroshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 395 - 399
  • [22] NON-LINE-OF-SIGHT LOCALIZATION USING LOW-RANK plus SPARSE MATRIX DECOMPOSITION
    Ekambaram, Venkatesan N.
    Ramchandran, Kannan
    2012 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2012, : 317 - 320
  • [23] Speech Denoising in White Noise Based on Signal Subspace Low-rank Plus Sparse Decomposition
    Yuan, Shuai
    Sun, Cheng-li
    2017 INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING (EITCE 2017), 2017, 128
  • [24] SCOPE: signal compensation for low-rank plus sparse matrix decomposition for fast parameter mapping
    Zhu, Yanjie
    Liu, Yuanyuan
    Ying, Leslie
    Peng, Xi
    Wang, Yi-Xiang J.
    Yuan, Jing
    Liu, Xin
    Liang, Dong
    PHYSICS IN MEDICINE AND BIOLOGY, 2018, 63 (18):
  • [25] Low-rank plus sparse decomposition based dynamic myocardial perfusion PET image restoration
    Lu, Lijun
    Ma, Xiaomian
    Ma, JIanhua
    Feng, Qianjin
    Rahmim, Arman
    Chen, Wufan
    JOURNAL OF NUCLEAR MEDICINE, 2015, 56 (03)
  • [26] UNDER-SAMPLED FUNCTIONAL MRI USING LOW-RANK PLUS SPARSE MATRIX DECOMPOSITION
    Singh, Vimal
    Tewfik, Ahmed H.
    Ress, David B.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 897 - 901
  • [27] Enhancement of dynamic myocardial perfusion PET images based on low-rank plus sparse decomposition
    Lu, Lijun
    Ma, Xiaomian
    Mohy-ud-Din, Hassan
    Ma, Jianhua
    Feng, Qianjin
    Rahmim, Arman
    Chen, Wufan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 154 : 57 - 69
  • [28] LOW-RANK ON GRAPHS PLUS TEMPORALLY SMOOTH SPARSE DECOMPOSITION FOR ANOMALY DETECTION IN SPATIOTEMPORAL DATA
    Sofuoglu, Seyyid Emre
    Aviyente, Selin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5614 - 5618
  • [29] Low-Rank and Adaptive Sparse Signal (LASSI) Models for Highly Accelerated Dynamic Imaging
    Ravishankar, Saiprasad
    Moore, Brian E.
    Nadakuditi, Raj Rao
    Fessler, Jeffrey A.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (05) : 1116 - 1128
  • [30] Beyond Low Rank plus Sparse: Multiscale Low Rank Matrix Decomposition
    Ong, Frank
    Lustig, Michael
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2016, 10 (04) : 672 - 687