Optimizing Convolutional Neural Networks for Image Classification on Resource-Constrained Microcontroller Units

被引:2
|
作者
Brockmann, Susanne [1 ]
Schlippe, Tim [1 ]
机构
[1] IU Int Univ Appl Sci, D-99084 Erfurt, Germany
关键词
TinyML; image classification; microcontroller units;
D O I
10.3390/computers13070173
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Running machine learning algorithms for image classification locally on small, cheap, and low-power microcontroller units (MCUs) has advantages in terms of bandwidth, inference time, energy, reliability, and privacy for different applications. Therefore, TinyML focuses on deploying neural networks on MCUs with random access memory sizes between 2 KB and 512 KB and read-only memory storage capacities between 32 KB and 2 MB. Models designed for high-end devices are usually ported to MCUs using model scaling factors provided by the model architecture's designers. However, our analysis shows that this naive approach of substantially scaling down convolutional neural networks (CNNs) for image classification using such default scaling factors results in suboptimal performance. Consequently, in this paper we present a systematic strategy for efficiently scaling down CNN model architectures to run on MCUs. Moreover, we present our CNN Analyzer, a dashboard-based tool for determining optimal CNN model architecture scaling factors for the downscaling strategy by gaining layer-wise insights into the model architecture scaling factors that drive model size, peak memory, and inference time. Using our strategy, we were able to introduce additional new model architecture scaling factors for MobileNet v1, MobileNet v2, MobileNet v3, and ShuffleNet v2 and to optimize these model architectures. Our best model variation outperforms the MobileNet v1 version provided in the MLPerf Tiny Benchmark on the Visual Wake Words image classification task, reducing the model size by 20.5% while increasing the accuracy by 4.0%.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks
    Lee, Eugene
    Lee, Chen-Yi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1475 - 1484
  • [32] Improving training datasets for resource-constrained speaker recognition neural networks
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2023, 2023, : 3167 - 3171
  • [33] Squeezing Accumulators in Binary Neural Networks for Extremely Resource-Constrained Applications
    Azamat, Azat
    Park, Jaewoo
    Lee, Jongeun
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [34] Optimizing Convolutional Neural Networks for low-resource devices
    Rusu, Cosmin-Ionut
    Czibula, Gabriela
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2018, : 245 - 252
  • [35] Optimizing convolutional neural networks architecture using a modified particle swarm optimization for image classification
    Elhani, D.
    Megherbi, A. C.
    Zitouni, A.
    Dornaika, F.
    Sbaa, S.
    Taleb-Ahmed, A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [36] Distill-Net: Application-Specific Distillation of Deep Convolutional Neural Networks for Resource-Constrained IoT Platforms
    Motamedi, Mohammad
    Portillo, Felix A.
    Fong, Daniel
    Ghiasi, Soheil
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
  • [37] A convolutional neural network for the resource-constrained project scheduling problem (RCPSP): A new approach
    Golab, Amir
    Gooya, Ehsan Sedgh
    Al Falou, Ayman
    Cabon, Mikael
    DECISION SCIENCE LETTERS, 2023, 12 (02) : 225 - 238
  • [38] A Runtime Switchable Multi-Phase Convolutional Neural Network for Resource-Constrained Systems
    Jang, Jeonggyu
    Yang, Hoeseok
    IEEE ACCESS, 2023, 11 : 62449 - 62461
  • [39] RAP: A Software Framework of Developing Convolutional Neural Networks for Resource-constrained Devices Using Environmental Monitoring as a Case Study
    Tu, Chia-Heng
    Sun, Qihui
    Chang, Hsiao-Hsuan
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2021, 5 (04)
  • [40] An automatic fire detection system based on deep convolutional neural networks for low-power, resource-constrained devices
    de Venancio, Pedro Vinicius A. B.
    Lisboa, Adriano C.
    Barbosa, Adriano, V
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15349 - 15368