Memory-Throughput Trade-off for CNN-Based Applications at the Edge

被引:4
|
作者
Minakova, Svetlana [1 ]
Stefanov, Todor [1 ]
机构
[1] Leiden Univ, Niels Bohrweg 1, NL-2333 CA Leiden, South Holland, Netherlands
关键词
Convolutional neural networks; AI at the edge; memory reduction; trade-off;
D O I
10.1145/3527457
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many modern applications require execution of Convolutional Neural Networks (CNNs) on edge devices, such as mobile phones or embedded platforms. This can be challenging, as the state-of-the art CNNs are memory costly, whereas the memory budget of edge devices is highly limited. To address this challenge, a variety of CNN memory reduction methodologies have been proposed. Typically, the memory of a CNN is reduced using methodologies such as pruning and quantization. These methodologies reduce the number or precision of CNN parameters, thereby reducing the CNN memory cost. When more aggressive CNN memory reduction is required, the pruning and quantization methodologies can be combined with CNN memory reuse methodologies. The latter methodologies reuse device memory allocated for storage of CNN intermediate computational results, thereby further reducing the CNN memory cost. However, the existing memory reuse methodologies are unfit for CNN-based applications that exploit pipeline parallelism available within the CNNs or use multiple CNNs to perform their functionality. In this article, we therefore propose a novel CNN memory reuse methodology. In our methodology, we significantly extend and combine two existing CNN memory reuse methodologies to offer efficient memory reuse for a wide range of CNN-based applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Approximate Acceleration for CNN-based Applications on IoT Edge Devices
    Castro-Godinez, Jorge
    Hernandez-Araya, Deykel
    Shafique, Muhammad
    Henkel, Joerg
    2020 IEEE 11TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS & SYSTEMS (LASCAS), 2020,
  • [2] Improving latency performance trade-off in keyword spotting applications at the edge
    Paissan, Francesco
    Sahabdeen, Anisha Mohamed
    Ancilotto, Alberto
    Farella, Elisabetta
    2023 9TH INTERNATIONAL WORKSHOP ON ADVANCES IN SENSORS AND INTERFACES, IWASI, 2023, : 299 - 304
  • [3] Throughput-Memory Footprint Trade-Off in Synthesis of Streaming Software on Embedded Multiprocessors
    Hashemi, Matin
    Foroozannejad, Mohammad H.
    Ghiasi, Soheil
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 13 (03)
  • [4] A Comprehensive analysis of Deployment Optimization Methods for CNN-Based Applications on Edge Devices
    Li, Qi
    Su, Zhenling
    Meng, Lin
    Elektrotehniski Vestnik/Electrotechnical Review, 2024, 91 (03): : 95 - 107
  • [5] Scenario Based Run-Time Switching for Adaptive CNN-Based Applications at the Edge
    Minakova, Svetlana
    Sapra, Dolly
    Stefanov, Todor
    Pimentel, Andy D.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (02)
  • [6] A Comprehensive analysis of Deployment Optimization Methods for CNN-Based Applications on Edge Devices
    Li, Qi
    Su, Zhenling
    Meng, Lin
    ELEKTROTEHNISKI VESTNIK, 2024, 91 (03): : 95 - 107
  • [7] CNN-based edge filtering for object proposals
    Waris, Muhammad Adeel
    Iosifidis, Alexandros
    Gabbouj, Moncef
    NEUROCOMPUTING, 2017, 266 : 631 - 640
  • [8] Throughput-delay trade-off in wireless networks
    El Gamal, A
    Mammen, J
    Prabhakar, B
    Shah, D
    IEEE INFOCOM 2004: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-4, PROCEEDINGS, 2004, : 464 - 475
  • [9] Delay and Throughput Trade-Off in WiMAX Mesh Networks
    Bastani, Saeed
    Yousefi, Saleh
    Mazoochi, Mojtaba
    Ghiamatyoun, Alireza
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS, 2009, : 283 - +
  • [10] On the Throughput-Delay Trade-off in Georouting Networks
    Jacquet, Philippe
    Malik, Salman
    Mans, Bernard
    Silva, Alonso
    2012 PROCEEDINGS IEEE INFOCOM, 2012, : 765 - 773