Skipping CNN Convolutions Through Efficient Memoization

被引:4
|
作者
de Moura, Rafael Fao [1 ]
Santos, Paulo C. [1 ]
de Lima, Joao Paulo C. [1 ]
Alves, Marco A. Z. [2 ]
Beck, Antonio C. S. [1 ]
Carro, Luigi [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Informat Inst, Porto Alegre, RS, Brazil
[2] Univ Fed Parana, Dept Informat, Curitiba, Parana, Brazil
关键词
Convolutional Neural Networks; Computation reuse; Memoization;
D O I
10.1007/978-3-030-27562-4_5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have become a de-facto standard for image and video recognition. However, current software and hardware implementations targeting convolutional operations still lack embracing energy budget constraints due to the CNN intensive data processing behavior. This paper proposes a software-based memoization technique to skip entire convolution calculations. We demonstrate that, by grouping output values within proximity-based clusters, it is possible to reduce by hundreds of times the amount of memory necessary to store all the tables. Also, we present a table mapping scheme to index the input set of each convolutional layer to its output value. Our experimental results show that for a YOLOv3-tiny CNN, it is possible to achieve a speedup up to 3.5x while reducing the energy consumption to 22% of the baseline with an accuracy loss of 7.4%.
引用
收藏
页码:65 / 76
页数:12
相关论文
共 50 条
  • [31] Faster General Parsing through Context-Free Memoization
    Herman, Grzegorz
    PROCEEDINGS OF THE 41ST ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '20), 2020, : 1022 - 1035
  • [32] AGGREGATED DILATED CONVOLUTIONS FOR EFFICIENT MOTION DEBLURRING
    Miao, Hong
    Zhang, Wenqiang
    Bai, Jiansong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [33] CondConv: Conditionally Parameterized Convolutions for Efficient Inference
    Yang, Brandon
    Bender, Gabriel
    Le, Quoc V.
    Ngiam, Jiquan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Structured Convolutions for Efficient Neural Network Design
    Bhalgat, Yash
    Zhang, Yizhe
    Lin, Jamie Menjay
    Porikli, Fatih
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [35] Warped Convolutions: Efficient Invariance to Spatial Transformations
    Henriques, Joao F.
    Vedaldi, Andrea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [36] A light-weight factorized convolutions based dual-input fuzzy-CNN for efficient motor bearing fault diagnosis
    Irfan, Muhammad
    Khan, Nabeel A.
    Mushtaq, Zohaib
    Kareri, Tareq
    Abbas, Ghulam
    Mursal, Salim Nasar Faraj
    NONDESTRUCTIVE TESTING AND EVALUATION, 2024,
  • [37] Window memoization: an efficient hardware architecture for high-performance image processing
    Khalvati, Farzad
    Aagaard, Mark D.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2010, 5 (03) : 195 - 212
  • [38] Skipper: Enabling efficient SNN training through activation-checkpointing and time-skipping
    Singh, Sonali
    Sarma, Anup
    Lu, Sen
    Sengupta, Abhronil
    Kandemir, Mahmut T.
    Neftci, Emre
    Narayanan, Vijaykrishnan
    Das, Chita R.
    2022 55TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2022, : 565 - 581
  • [39] Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference
    Shen, Jianghao
    Wang, Yue
    Xu, Pengfei
    Fu, Yonggan
    Wang, Zhangyang
    Lin, Yingyan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5700 - 5708
  • [40] Efficient Table Border Segmentation with Asymmetric Convolutions
    Minouei, Mohammad
    Soheili, Mohammad Reza
    Stricker, Didier
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084