Memory-Throughput Trade-off for CNN-Based Applications at the Edge

被引:4
|
作者
Minakova, Svetlana [1 ]
Stefanov, Todor [1 ]
机构
[1] Leiden Univ, Niels Bohrweg 1, NL-2333 CA Leiden, South Holland, Netherlands
关键词
Convolutional neural networks; AI at the edge; memory reduction; trade-off;
D O I
10.1145/3527457
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many modern applications require execution of Convolutional Neural Networks (CNNs) on edge devices, such as mobile phones or embedded platforms. This can be challenging, as the state-of-the art CNNs are memory costly, whereas the memory budget of edge devices is highly limited. To address this challenge, a variety of CNN memory reduction methodologies have been proposed. Typically, the memory of a CNN is reduced using methodologies such as pruning and quantization. These methodologies reduce the number or precision of CNN parameters, thereby reducing the CNN memory cost. When more aggressive CNN memory reduction is required, the pruning and quantization methodologies can be combined with CNN memory reuse methodologies. The latter methodologies reuse device memory allocated for storage of CNN intermediate computational results, thereby further reducing the CNN memory cost. However, the existing memory reuse methodologies are unfit for CNN-based applications that exploit pipeline parallelism available within the CNNs or use multiple CNNs to perform their functionality. In this article, we therefore propose a novel CNN memory reuse methodology. In our methodology, we significantly extend and combine two existing CNN memory reuse methodologies to offer efficient memory reuse for a wide range of CNN-based applications.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] On Time-Memory Trade-Off for Collision Detection
    Rizaldi, Albert
    Soentges, Sebastian
    Althoff, Matthias
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 1173 - 1180
  • [22] Stress and the trade-off between hippocampal and striatal memory
    Goldfarb, Elizabeth V.
    Phelps, Elizabeth A.
    CURRENT OPINION IN BEHAVIORAL SCIENCES, 2017, 14 : 47 - 53
  • [23] SPEED-ACCURACY TRADE-OFF IN RECOGNITION MEMORY
    REED, AV
    SCIENCE, 1973, 181 (4099) : 574 - 576
  • [24] Characterization and improvement of time-memory trade-off based on perfect tables
    Avoine, Gildas
    Junod, Pascal
    Oechslin, Philippe
    ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2008, 11 (04)
  • [25] microGEMM: An Effective CNN-based Inference Acceleration for Edge Computing
    Liu, Zheng
    Chen, Wei
    Qian, Kai
    Lu, Haodong
    Liu, Yinqiu
    Chen, Siguang
    Wang, Kun
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 2474 - 2479
  • [26] Caching in Mobile HetNets: A Throughput-Delay Trade-off Perspective
    Trung-Anh Do
    Jeon, Sang-Woon
    Shin, Won-Yong
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 1247 - 1251
  • [27] On optimizing the security-throughput trade-off in wireless networks with adversaries
    Haleem, Mohamed A.
    Mathur, Chetan Nanjunda
    Chandramouli, R.
    Subbalakshmi, K. P.
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, PROCEEDINGS, 2006, 3989 : 448 - 458
  • [28] Opportunistic encryption: A trade-off between security and throughput in wireless networks
    Haleem, Mohamed A.
    Mathur, Chetan N.
    Chandramouli, R.
    Subbalakshmi, K. P.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (04) : 313 - 324
  • [29] Fairness and Throughput Trade-Off Analysis for UMTS WCDMA Network Planning
    D. C. Tsilimantos
    D. A. Zarbouti
    G. V. Tsoulos
    G. E. Athanasiadou
    D. I. Kaklamani
    Wireless Personal Communications, 2011, 56 : 693 - 714
  • [30] Link Adaptation in Massive MIMO: Throughput-Fairness Trade-Off
    Blandino, Steve
    Desset, Claude
    Chiumento, Alessandro
    Bourdoux, Andre
    Van der Perre, Liesbet
    Pollin, Sofie
    2017 IEEE SYMPOSIUM ON COMMUNICATIONS AND VEHICULAR TECHNOLOGY (SCVT), 2017,