Block-Based Compression and Corresponding Hardware Circuits for Sparse Activations

被引:1
|
作者
Weng, Yui-Kai [1 ]
Huang, Shih-Hsu [1 ]
Kao, Hsu-Yu [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 32023, Taiwan
关键词
compression formats; convolutional neural networks; data volume; digital circuits; edge computing; logic design; DEEP NEURAL-NETWORKS; HIGH-SPEED; CNN; ACCELERATOR; UNIT;
D O I
10.3390/s21227468
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In a CNN (convolutional neural network) accelerator, to reduce memory traffic and power consumption, there is a need to exploit the sparsity of activation values. Therefore, some research efforts have been paid to skip ineffectual computations (i.e., multiplications by zero). Different from previous works, in this paper, we point out the similarity of activation values: (1) in the same layer of a CNN model, most feature maps are either highly dense or highly sparse; (2) in the same layer of a CNN model, feature maps in different channels are often similar. Based on the two observations, we propose a block-based compression approach, which utilizes both the sparsity and the similarity of activation values to further reduce the data volume. Moreover, we also design an encoder, a decoder and an indexing module to support the proposed approach. The encoder is used to translate output activations into the proposed block-based compression format, while both the decoder and the indexing module are used to align nonzero values for effectual computations. Compared with previous works, benchmark data consistently show that the proposed approach can greatly reduce both memory traffic and power consumption.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] HRBP: Hardware-friendly Regrouping towards Block-based Pruning for Sparse CNN Training
    Ma, Haoyu
    Zhang, Chengming
    Xiang, Lizhi
    Ma, Xiaolong
    Yuan, Geng
    Zhang, Wenkai
    Liu, Shiwei
    Chen, Tianlong
    Tao, Dingwen
    Wang, Yanzhi
    Wang, Zhangyang
    Xie, Xiaohui
    [J]. CONFERENCE ON PARSIMONY AND LEARNING, VOL 234, 2024, 234 : 282 - 301
  • [2] Distributed bundle adjustment with block-based sparse matrix compression for super large scale datasets
    Zheng, Maoteng
    Chen, Nengcheng
    Zhu, Junfeng
    Zeng, Xiaoru
    Qiu, Huanbin
    Jiang, Yuyao
    Lu, Xingyue
    Qu, Hao
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18106 - 18116
  • [3] Learned Block-Based Hybrid Image Compression
    Wu, Yaojun
    Li, Xin
    Zhang, Zhizheng
    Jin, Xin
    Chen, Zhibo
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3978 - 3990
  • [4] Block-based fast compression for compound images
    Ding, Wenpeng
    Liu, Dong
    He, Yuwen
    Wu, Feng
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 809 - 812
  • [5] Block-based KLT compression for multispectral images
    Nian, Yongjian
    Xu, Ke
    Wan, Jianwei
    Wang, Ling
    He, Mi
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2016, 14 (04)
  • [6] An Adaptive Block-Based Histogram Packing for Improving the Compression Performance of JPEG-LS for Images with Sparse and Locally Sparse Histograms
    Jallouli, Souha
    Zouari, Sonia
    Masmoudi, Nouri
    Masmoudi, Atef
    [J]. IMAGE AND SIGNAL PROCESSING (ICISP 2018), 2018, 10884 : 63 - 71
  • [7] Skip block-based wavelet compression for hyperspectral images
    Banu, N. M. Masoodhu
    Sasikumar, S.
    [J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2016, 9 (03) : 156 - 164
  • [8] A simple block-based lossless image compression scheme
    Chang, SG
    Yovanof, GS
    [J]. THIRTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1997, : 591 - 595
  • [9] A hardware implementation of a dynamically adjustable Block-Based Neural Network
    Liu, SY
    Peterson, GD
    Kong, SG
    [J]. ERSA'05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2005, : 203 - 208
  • [10] Fixed block-based lossless compression of digital mammograms
    Al-Saiegh, MY
    Krishnan, S
    [J]. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 937 - 942