An Accuracy-Driven Compression Methodology to Derive Efficient Codebook-Based CNNs

被引:0
|
作者
Ponzina, Flavio [1 ]
Peon-Quiros, Miguel [1 ]
Ansaloni, Giovanni [1 ]
Atienza, David [1 ]
机构
[1] Swiss Fed Inst Technol EPFL, Embedded Syst Lab ESL, Route Cantonale, CH-1015 Lausanne, Switzerland
基金
欧盟地平线“2020”;
关键词
CNN compression; Clustering; Ensembling;
D O I
10.1109/COINS54846.2022.9854986
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Codebook-based optimizations are a class of algorithmic-level transformations able to effectively reduce the computing and memory requirements of Convolutional Neural Networks (CNNs). This approach tightly limits the number of unique weights in each layer, allowing the storage of employed values in codebooks containing a small number of floating-point entries. Then, CNN models are represented as low-bitwidth indexes of such codebooks. This work introduces a novel iterative methodology to find highly beneficial schemes trading off accuracy and model compression in codebook-based CNNs. Our strategy can retrieve non-uniform solutions driven by an accuracy constraint embedded in the optimization loop. Our results indicate that, for a 1 % accuracy degradation, our methodology can compress baseline floating-point CNN models up to 19x. Moreover, by reducing the number of memory accesses, our strategy increases energy efficiency and improves inference performance by up to 91 %.
引用
收藏
页码:24 / 29
页数:6
相关论文
共 16 条
  • [1] An Area-Efficient FPGA Realisation of a Codebook-Based Image Compression Method
    Zipf, Peter
    Hinkelmann, Heiko
    Shao, Hui
    Dogaru, Radu
    Glesner, Manfred
    [J]. PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY, 2008, : 349 - +
  • [2] A Fast and Efficient Codebook-Based RIS Phase Configuration Method
    Haskou, Abdullah
    Khaleghi, Hamidreza
    [J]. 2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 13 - 17
  • [3] Efficient Codebook-Based MIMO Beamforming for Millimeter-Wave WLANs
    Zhou, Liang
    Ohashi, Yoji
    [J]. 2012 IEEE 23RD INTERNATIONAL SYMPOSIUM ON PERSONAL INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2012, : 1885 - 1889
  • [4] Dynamically restricted codebook-based vector quantisation scheme for mesh geometry compression
    Zhe-Ming Lu
    Zhen Li
    [J]. Signal, Image and Video Processing, 2008, 2 : 251 - 260
  • [5] Dynamically restricted codebook-based vector quantisation scheme for mesh geometry compression
    Lu, Zhe-Ming
    Li, Zhen
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2008, 2 (03) : 251 - 260
  • [6] Intra-picture Block-matching Method for Codebook-based Texture Compression
    Cui, Li
    Jang, Euee S.
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (10): : 5063 - 5073
  • [7] An Efficient Codebook-based Beam Training Technique for Millimeter-Wave Communication Systems
    Okoth, Phonfred J.
    Nguyen, Quang N.
    Dhakal, Dhruba R.
    Nozaki, Daichi
    Yamada, Yoshihide
    Sato, Takuro
    [J]. 2018 ASIA-PACIFIC MICROWAVE CONFERENCE PROCEEDINGS (APMC), 2018, : 666 - 668
  • [8] Efficient Codebook-Based Beamforming Algorithm for Millimeter-Wave Massive MIMO Systems
    Chen, Jung-Chieh
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (09) : 7809 - 7817
  • [9] High-rate compression of ECG signals by an accuracy-driven sparsity model relying on natural basis
    Grossi, Giuliano
    Lanzarotti, Raffaella
    Lin, Jianyi
    [J]. DIGITAL SIGNAL PROCESSING, 2015, 45 : 96 - 106
  • [10] Codebook-based pseudo-impostor data generation and template compression for text-dependent speaker verification
    Luan, Jian
    Hao, Jie
    Kakino, Tomonari
    Kawamura, Akinori
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (09) : 1414 - 1421