Memory-Throughput Trade-off for CNN-Based Applications at the Edge

被引:4
|
作者
Minakova, Svetlana [1 ]
Stefanov, Todor [1 ]
机构
[1] Leiden Univ, Niels Bohrweg 1, NL-2333 CA Leiden, South Holland, Netherlands
关键词
Convolutional neural networks; AI at the edge; memory reduction; trade-off;
D O I
10.1145/3527457
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many modern applications require execution of Convolutional Neural Networks (CNNs) on edge devices, such as mobile phones or embedded platforms. This can be challenging, as the state-of-the art CNNs are memory costly, whereas the memory budget of edge devices is highly limited. To address this challenge, a variety of CNN memory reduction methodologies have been proposed. Typically, the memory of a CNN is reduced using methodologies such as pruning and quantization. These methodologies reduce the number or precision of CNN parameters, thereby reducing the CNN memory cost. When more aggressive CNN memory reduction is required, the pruning and quantization methodologies can be combined with CNN memory reuse methodologies. The latter methodologies reuse device memory allocated for storage of CNN intermediate computational results, thereby further reducing the CNN memory cost. However, the existing memory reuse methodologies are unfit for CNN-based applications that exploit pipeline parallelism available within the CNNs or use multiple CNNs to perform their functionality. In this article, we therefore propose a novel CNN memory reuse methodology. In our methodology, we significantly extend and combine two existing CNN memory reuse methodologies to offer efficient memory reuse for a wide range of CNN-based applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] A Trade-off Between Delay and Throughput in IMS Network Session Setup
    Oveis, Amir Hosein
    Asadi, Mohammad Ali Shakeri
    Homami, Seyed Mostafa Safavi
    2014 22nd Iranian Conference on Electrical Engineering (ICEE), 2014, : 1614 - 1618
  • [42] An Expandable Hardware Platform for Implementation of CNN-Based Applications
    Javier Martinez-Alvarez, J.
    Javier Garrigos-Guerrero, F.
    Javier Toledo-Moreo, F.
    Manuel Ferrandez-Vicente, J.
    NEW CHALLENGES ON BIOINSPIRED APPLICATIONS: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART II, 2011, 6687 : 195 - 204
  • [43] A Blockchain Consensus Protocol Based on Dedicated Time-Memory-Data Trade-Off
    Mihaljevic, Miodrag J.
    IEEE ACCESS, 2020, 8 (08): : 141258 - 141268
  • [44] Solving the Memory-based Memoryless Trade-off Problem for EEG Signal Classification
    Park, Jungbae
    Lee, Sang Wan
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 505 - 510
  • [45] Reservoir computing decoupling memory-nonlinearity trade-off
    Xia, Ji
    Chu, Junyu
    Leng, Siyang
    Ma, Huanfei
    CHAOS, 2023, 33 (11)
  • [46] Reservoir Computing Beyond Memory-Nonlinearity Trade-off
    Masanobu Inubushi
    Kazuyuki Yoshimura
    Scientific Reports, 7
  • [47] TRADE-OFF BETWEEN MEMORY FOR VERBAL ITEMS AND THEIR VISUAL ATTRIBUTES
    LIGHT, LL
    BERGER, DE
    BARDALES, M
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1975, 104 (02): : 188 - 193
  • [49] Trade-off between Capacity and Precision in Visuospatial Working Memory
    Roggeman, Chantal
    Klingberg, Torkel
    Feenstra, Heleen E. M.
    Compte, Albert
    Almeida, Rita
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2014, 26 (02) : 211 - 222
  • [50] Reservoir Computing Beyond Memory-Nonlinearity Trade-off
    Inubushi, Masanobu
    Yoshimura, Kazuyuki
    SCIENTIFIC REPORTS, 2017, 7