Skipping CNN Convolutions Through Efficient Memoization

被引:4
|
作者
de Moura, Rafael Fao [1 ]
Santos, Paulo C. [1 ]
de Lima, Joao Paulo C. [1 ]
Alves, Marco A. Z. [2 ]
Beck, Antonio C. S. [1 ]
Carro, Luigi [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Informat Inst, Porto Alegre, RS, Brazil
[2] Univ Fed Parana, Dept Informat, Curitiba, Parana, Brazil
关键词
Convolutional Neural Networks; Computation reuse; Memoization;
D O I
10.1007/978-3-030-27562-4_5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have become a de-facto standard for image and video recognition. However, current software and hardware implementations targeting convolutional operations still lack embracing energy budget constraints due to the CNN intensive data processing behavior. This paper proposes a software-based memoization technique to skip entire convolution calculations. We demonstrate that, by grouping output values within proximity-based clusters, it is possible to reduce by hundreds of times the amount of memory necessary to store all the tables. Also, we present a table mapping scheme to index the input set of each convolutional layer to its output value. Our experimental results show that for a YOLOv3-tiny CNN, it is possible to achieve a speedup up to 3.5x while reducing the energy consumption to 22% of the baseline with an accuracy loss of 7.4%.
引用
收藏
页码:65 / 76
页数:12
相关论文
共 50 条
  • [1] Efficient FPGA design for Convolutions in CNN based on FFT-pruning
    He, Liulu
    Xie, Xiaoru
    Lin, Jun
    Wang, Zhongfeng
    APCCAS 2020: PROCEEDINGS OF THE 2020 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2020), 2020, : 27 - 30
  • [2] Synthesizing Efficient Memoization Algorithms
    Sun, Yican
    Peng, Xuanyu
    Xiong, Yingfei
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2023, 7 (OOPSLA):
  • [3] A Unified Hardware Architecture for Convolutions and Deconvolutions in CNN
    Bai, Lin
    Lyu, Yecheng
    Huang, Xinming
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [4] Nonlinear CNN: improving CNNs with quadratic convolutions
    Jiang, Yiyang
    Yang, Fan
    Zhu, Hengliang
    Zhou, Dian
    Zeng, Xuan
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8507 - 8516
  • [5] Energy Efficient FFT Implementation through Stage Skipping and Merging
    Sharma, Namita
    Panda, Preeti Ranjan
    Catthoor, Francky
    2015 INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE CODESIGN AND SYSTEM SYNTHESIS (CODES+ISSS), 2015, : 153 - 162
  • [6] Nonlinear CNN: improving CNNs with quadratic convolutions
    Yiyang Jiang
    Fan Yang
    Hengliang Zhu
    Dian Zhou
    Xuan Zeng
    Neural Computing and Applications, 2020, 32 : 8507 - 8516
  • [7] A Power-Efficient CNN Accelerator With Similar Feature Skipping for Face Recognition in Mobile Devices
    Kim, Sangyeob
    Lee, Juhyoung
    Kang, Sanghoon
    Lee, Jinsu
    Yoo, Hoi-Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (04) : 1181 - 1193
  • [8] Efficient Zero-Activation-Skipping for On-Chip Low-Energy CNN Acceleration
    Liu, Min
    He, Yifan
    Jiao, Hailong
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [9] An Efficient Composition of Bidirectional Programs by Memoization and Lazy Update
    Tsushima, Kanae
    Trong, Bach Nguyen
    Gluck, Robert
    Hu, Zhenjiang
    FUNCTIONAL AND LOGIC PROGRAMMING, FLOPS 2020, 2020, 12073 : 159 - 178
  • [10] Efficient Parallel Associative Classification Based on Rules Memoization
    Pires, Michel
    Silva, Nicollas
    Rocha, Leonardo
    Meira, Wagner
    Ferreira, Renato
    COMPUTATIONAL SCIENCE - ICCS 2019, PT IV, 2019, 11539 : 31 - 44