Block-Based Compression and Corresponding Hardware Circuits for Sparse Activations

被引:1
|
作者
Weng, Yui-Kai [1 ]
Huang, Shih-Hsu [1 ]
Kao, Hsu-Yu [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 32023, Taiwan
关键词
compression formats; convolutional neural networks; data volume; digital circuits; edge computing; logic design; DEEP NEURAL-NETWORKS; HIGH-SPEED; CNN; ACCELERATOR; UNIT;
D O I
10.3390/s21227468
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In a CNN (convolutional neural network) accelerator, to reduce memory traffic and power consumption, there is a need to exploit the sparsity of activation values. Therefore, some research efforts have been paid to skip ineffectual computations (i.e., multiplications by zero). Different from previous works, in this paper, we point out the similarity of activation values: (1) in the same layer of a CNN model, most feature maps are either highly dense or highly sparse; (2) in the same layer of a CNN model, feature maps in different channels are often similar. Based on the two observations, we propose a block-based compression approach, which utilizes both the sparsity and the similarity of activation values to further reduce the data volume. Moreover, we also design an encoder, a decoder and an indexing module to support the proposed approach. The encoder is used to translate output activations into the proposed block-based compression format, while both the decoder and the indexing module are used to align nonzero values for effectual computations. Compared with previous works, benchmark data consistently show that the proposed approach can greatly reduce both memory traffic and power consumption.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Medical image compression using block-based transform coding techniques
    DeNeve, P
    Philips, W
    VanOverloop, J
    Lemahieu, I
    [J]. DIGITAL COMPRESSION TECHNOLOGIES AND SYSTEMS FOR VIDEO COMMUNICATIONS, 1996, 2952 : 216 - 223
  • [42] Towards block-based compression of genomic data with random access functionality
    Paridaens, Tom
    Van Stappen, Yves
    De Neve, Wesley
    Lambert, Peter
    Van de Walle, Rik
    [J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 1360 - 1363
  • [43] Reconstruction for block-based compressive sensing of image with reweighted double sparse constraint
    Yuanhong Zhong
    Jing Zhang
    Xinyu Cheng
    Guan Huang
    Zhaokun Zhou
    Zhiyong Huang
    [J]. EURASIP Journal on Image and Video Processing, 2019
  • [44] Reconstruction for block-based compressive sensing of image with reweighted double sparse constraint
    Zhong, Yuanhong
    Zhang, Jing
    Cheng, Xinyu
    Huang, Guan
    Zhou, Zhaokun
    Huang, Zhiyong
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (1) : 1 - 14
  • [45] A Comparison Between Block-Based and Non Block-Based Watermarking Schemes based on DWT
    Al-Qershi, Osamah M.
    Ee, Khoo Bee
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL I, 2009, : 169 - 173
  • [46] Synthesis Procedure of Configurable Building Block-Based Linear and Nonlinear Analog Circuits
    Bhanja, Mousumi
    Ray, Baidya Nath
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (12) : 1940 - 1953
  • [47] Deep Learning-Based Hardware Trojan Detection With Block-Based Netlist Information Extraction
    Yu, Shichao
    Gu, Chongyan
    Liu, Weiqiang
    O'Neill, Maire
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (04) : 1837 - 1853
  • [48] A Block-Based JPEG-LS Compression Technique with Lossless Region of Interest
    Deng, Lihua
    Huang, Zhenghua
    Yao, Shoukui
    [J]. MIPPR 2017: PARALLEL PROCESSING OF IMAGES AND OPTIMIZATION TECHNIQUES; AND MEDICAL IMAGING, 2018, 10610
  • [49] Constrained ECG compression algorithm using the block-based discrete cosine transform
    Benzid, R.
    Messaoudi, A.
    Boussaad, A.
    [J]. DIGITAL SIGNAL PROCESSING, 2008, 18 (01) : 56 - 64
  • [50] Issues in implementing block-based image compression techniques on parallel MIMD architectures
    Uhl, A
    Hammerle, J
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 494 - 501