Optimizing Convolutional Neural Networks for Image Classification on Resource-Constrained Microcontroller Units

被引:2
|
作者
Brockmann, Susanne [1 ]
Schlippe, Tim [1 ]
机构
[1] IU Int Univ Appl Sci, D-99084 Erfurt, Germany
关键词
TinyML; image classification; microcontroller units;
D O I
10.3390/computers13070173
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Running machine learning algorithms for image classification locally on small, cheap, and low-power microcontroller units (MCUs) has advantages in terms of bandwidth, inference time, energy, reliability, and privacy for different applications. Therefore, TinyML focuses on deploying neural networks on MCUs with random access memory sizes between 2 KB and 512 KB and read-only memory storage capacities between 32 KB and 2 MB. Models designed for high-end devices are usually ported to MCUs using model scaling factors provided by the model architecture's designers. However, our analysis shows that this naive approach of substantially scaling down convolutional neural networks (CNNs) for image classification using such default scaling factors results in suboptimal performance. Consequently, in this paper we present a systematic strategy for efficiently scaling down CNN model architectures to run on MCUs. Moreover, we present our CNN Analyzer, a dashboard-based tool for determining optimal CNN model architecture scaling factors for the downscaling strategy by gaining layer-wise insights into the model architecture scaling factors that drive model size, peak memory, and inference time. Using our strategy, we were able to introduce additional new model architecture scaling factors for MobileNet v1, MobileNet v2, MobileNet v3, and ShuffleNet v2 and to optimize these model architectures. Our best model variation outperforms the MobileNet v1 version provided in the MLPerf Tiny Benchmark on the Visual Wake Words image classification task, reducing the model size by 20.5% while increasing the accuracy by 4.0%.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Resource-Constrained Binary Image Classification
    Park, Sean
    Wicker, Jorg
    Dost, Katharina
    DISCOVERY SCIENCE, DS 2024, PT II, 2025, 15244 : 215 - 230
  • [2] T-Net: A Resource-Constrained Tiny Convolutional Neural Network for Medical Image Segmentation
    Khan, Tariq M.
    Robles-Kelly, Antonio
    Naqvi, Syed S.
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1799 - 1808
  • [3] Convolutional Neural Networks for image classification
    Jmour, Nadia
    Zayen, Sehla
    Abdelkrim, Afef
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 397 - 402
  • [4] FeatherNet: An Accelerated Convolutional Neural Network Design for Resource-constrained FPGAs
    Morcel, Raghid
    Hajj, Hazem M.
    Saghir, Mazen A. R.
    Akkary, Haitham
    Artail, Hassan
    Khanna, Rahul
    Keshavamurthy, Anil
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (02)
  • [5] Resource-Constrained Classification Using a Cascade of Neural Network Layers
    Leroux, Sam
    Bohez, Steven
    Verbelen, Tim
    Vankeirsbilck, Bert
    Simoens, Pieter
    Dhoedt, Bart
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [6] Cloud-assisted collaborative inference of convolutional neural networks for vision tasks on resource-constrained devices
    Rodriguez-Conde, Ivan
    Campos, Celso
    Fdez-Riverola, Florentino
    NEUROCOMPUTING, 2023, 560
  • [7] Planet Optimization with Deep Convolutional Neural Network for Lightweight Intrusion Detection in Resource-Constrained IoT Networks
    Alissa, Khalid A.
    Alrayes, Fatma S.
    Tarmissi, Khaled
    Yafoz, Ayman
    Alsini, Raed
    Alghushairy, Omar
    Othman, Mahmoud
    Motwakel, Abdelwahed
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [8] Optimization of Convolutional Neural Networks on Resource Constrained Devices
    Arish, S.
    Sinha, Sharad
    Smitha, K. G.
    2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 19 - 24
  • [9] Iterative neural networks for adaptive inference on resource-constrained devices
    Sam Leroux
    Tim Verbelen
    Pieter Simoens
    Bart Dhoedt
    Neural Computing and Applications, 2022, 34 : 10321 - 10336
  • [10] Iterative neural networks for adaptive inference on resource-constrained devices
    Leroux, Sam
    Verbelen, Tim
    Simoens, Pieter
    Dhoedt, Bart
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10321 - 10336