CNN Inference Accelerators with Adjustable Feature Map Compression Ratios

被引:0
|
作者
Tsai, Yu-Chih [1 ]
Liu, Chung-Yueh [1 ]
Wang, Chia-Chun [1 ]
Hsu, Tsen-Wei [1 ]
Liu, Ren-Shuo [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
CNN; Inference-Time Adjustability; Feature Map Compression; Memory Bandwidth; Hardware Accelerator;
D O I
10.1109/ICCD58817.2023.00099
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, an increasing interest has been in developing a convolution neural network (CNN) with adjustable configurations, enabling instant adaption to different resource constraints during inference. The trained CNN in run-time can switch to different modes to achieve a certain accuracy-energy trade-off point, similar to DVFS (dynamic voltage and frequency scaling) and turbo boost, which are widely adopted in CPUs. In this paper, we propose strategies to enable CNN inference accelerators to have an adjustable feature map compression ratio, making them tunable regarding their external memory access amount. We resort to the mature JPEG technique to compress those intermediate feature maps. The critical challenge is to support such adjustable compression ratios using one single CNN instead of multiple CNNs corresponding to multiple ratios. In response, we propose compression-aware joint-training and switchable batch normalization. We use ResNet18, ResNet50, and MobileNetV2 on ImageNet to demonstrate our design, achieve inference-time compression ratio adjustability, and reduce external memory access bandwidth requirements. The result shows that our proposed strategies can maintain the Top-1 accuracy and reduce external memory access by at most 22.7x similar to 28.3x only using a single CNN model with sets of BN parameters corresponding to multiple compression ratios.
引用
收藏
页码:631 / 634
页数:4
相关论文
共 50 条
  • [11] A Max Pooling Hardware Architecture Supporting Inference And Training For CNN Accelerators
    Kim, Sanghyun
    Lee, Eunchong
    Lee, Minkyu
    Kim, Kyungho
    Lee, Sang-Seol
    Jang, Sung-Joon
    2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 313 - 314
  • [12] Towards CNN map representation and compression for camera relocalisation
    Contreras, Luis
    Mayol-Cuevas, Walterio
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 405 - 412
  • [13] Feature Map Reduction in CNN for Handwritten Digit Recognition
    Chakraborty, Sinjan
    Paul, Sayantan
    Sarkar, Ram
    Nasipuri, Mita
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 143 - 148
  • [14] Progressive kernel pruning CNN compression method with an adjustable input channel
    Jihong Zhu
    Jihong Pei
    Applied Intelligence, 2022, 52 : 10519 - 10540
  • [15] Progressive kernel pruning CNN compression method with an adjustable input channel
    Zhu, Jihong
    Pei, Jihong
    APPLIED INTELLIGENCE, 2022, 52 (09) : 10519 - 10540
  • [16] Compression or Corruption? A Study on the Effects of Transient Faults on BNN Inference Accelerators
    Khoshavi, Navid
    Broyles, Connor
    Bi, Yu
    PROCEEDINGS OF THE TWENTYFIRST INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2020), 2020, : 99 - 104
  • [17] Efficient Dynamic Fixed-Point Quantization of CNN Inference Accelerators for Edge Devices
    Wu, Yueh-Chi
    Huang, Chih-Tsun
    2019 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2019,
  • [18] Selective Feature Compression for Efficient Activity Recognition Inference
    Liu, Chunhui
    Li, Xinyu
    Chen, Hao
    Modolo, Davide
    Tighe, Joseph
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13608 - 13617
  • [19] Face spoof detection using feature map superposition and CNN
    Gu, Fei
    Xia, Zhihua
    Fei, Jianwei
    Yuan, Chengsheng
    Zhang, Qiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 22 (2-3) : 355 - 363
  • [20] A novel feature map compression method based on feature transformation for VCM
    Lee, Minhun
    Park, Seungjin
    Oh, Seoung-Jun
    Kim, Younhee
    Jeong, Se Yoon
    Sim, Donggyu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592