Channel-Wise Activation Map Pruning using MaxPool for Reducing Memory Accesses

被引:2
|
作者
Cho, Han [1 ]
Park, Jongsun [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional neural network; max-pool; activation compression; activation pruning; L2-norm;
D O I
10.1109/ISOCC56007.2022.10031452
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While neural network pruning can reduce the amount of data transfers, pruning techniques such as fine-grained pruning cannot be efficiently implemented as indexing overhead is high. In order to design a hardware friendly pruning technique, structured pruning, which removes groups of data together to minimize the indexing overhead, is highly required. In this paper, to reduce the overall number of main memory accesses when implementing convolutional neural network (CNN) accelerators, we propose a hardware friendly L2-norm based structured channel-wise activation pruning using max-pooling. Max-pooling is typically deployed in CNN to decrease the height and width of CNN activation maps. The simulation results of the proposed technique shows that the activation maps of ResNet20 and ResNet56 can be reduced to 48% and 51%, respectively, with less than 1% accuracy degradation on CIFAR-10.
引用
收藏
页码:71 / 72
页数:2
相关论文
共 36 条
  • [31] High-Speed Planning and Reducing Memory Usage of a Precomputed Search Tree Using Pruning
    Suzuki, Yumiko
    Thompson, Simon
    Kagami, Satoshi
    ADVANCED ROBOTICS, 2010, 24 (5-6) : 903 - 920
  • [32] Motor Imagery Classification Using Inter-Task Transfer Learning via a Channel-Wise Variational Autoencoder-Based Convolutional Neural Network
    Lee, Do-Yeun
    Jeong, Ji-Hoon
    Lee, Byeong-Hoo
    Lee, Seong-Whan
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2022, 30 : 226 - 237
  • [33] Reducing Impact of CNFET Process Imperfections on Shape of Activation Function by Using Connection Pruning and Approximate Neuron Circuit
    Sheikh, Kaship
    Wei, Lan
    PROCEEDINGS OF THE TWENTYFIRST INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2020), 2020, : 279 - 284
  • [34] An explainable machine learning based flashover prediction model using dimension-wise class activation map
    Fan, Linhao
    Tam, Wai Cheong
    Tong, Qi
    Fu, Eugene Yujun
    Liang, Tianshui
    FIRE SAFETY JOURNAL, 2023, 140
  • [35] Reducing On-chip Memory for Massive MIMO Baseband Processing using Channel Compression
    Liu, Yangxurui
    Edfors, Ove
    Liu, Liang
    Owal, Viktor
    2017 IEEE 86TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2017,
  • [36] C-MAN: A Multi-attention Approach for Precise Plant Species Classification and Disease Detection Using Multi-scale, Channel-Wise, and Cross-Modal Attentions
    Prasad, Pulicherla Siva
    Agniraj, Senthilrajan
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1231 - 1243