Optimizing Convolutional Neural Networks for Image Classification on Resource-Constrained Microcontroller Units

被引：2

作者：

Brockmann, Susanne ^{[1
]}

Schlippe, Tim ^{[1
]}

机构：

[1] IU Int Univ Appl Sci, D-99084 Erfurt, Germany

来源：

COMPUTERS | 2024年 / 13卷 / 07期

关键词：

TinyML; image classification; microcontroller units;

D O I：

10.3390/computers13070173

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Running machine learning algorithms for image classification locally on small, cheap, and low-power microcontroller units (MCUs) has advantages in terms of bandwidth, inference time, energy, reliability, and privacy for different applications. Therefore, TinyML focuses on deploying neural networks on MCUs with random access memory sizes between 2 KB and 512 KB and read-only memory storage capacities between 32 KB and 2 MB. Models designed for high-end devices are usually ported to MCUs using model scaling factors provided by the model architecture's designers. However, our analysis shows that this naive approach of substantially scaling down convolutional neural networks (CNNs) for image classification using such default scaling factors results in suboptimal performance. Consequently, in this paper we present a systematic strategy for efficiently scaling down CNN model architectures to run on MCUs. Moreover, we present our CNN Analyzer, a dashboard-based tool for determining optimal CNN model architecture scaling factors for the downscaling strategy by gaining layer-wise insights into the model architecture scaling factors that drive model size, peak memory, and inference time. Using our strategy, we were able to introduce additional new model architecture scaling factors for MobileNet v1, MobileNet v2, MobileNet v3, and ShuffleNet v2 and to optimize these model architectures. Our best model variation outperforms the MobileNet v1 version provided in the MLPerf Tiny Benchmark on the Visual Wake Words image classification task, reducing the model size by 20.5% while increasing the accuracy by 4.0%.

引用

页数：18

共 50 条

[31] NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks
Lee, Eugene
Lee, Chen-Yi
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1475 - 1484
[32] Improving training datasets for resource-constrained speaker recognition neural networks
Bousquet, Pierre-Michel
Rouvier, Mickael
INTERSPEECH 2023, 2023, : 3167 - 3171
[33] Squeezing Accumulators in Binary Neural Networks for Extremely Resource-Constrained Applications
Azamat, Azat
Park, Jaewoo
Lee, Jongeun
2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
[34] Optimizing Convolutional Neural Networks for low-resource devices
Rusu, Cosmin-Ionut
Czibula, Gabriela
2018 IEEE 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2018, : 245 - 252
[35] Optimizing convolutional neural networks architecture using a modified particle swarm optimization for image classification
Elhani, D.
Megherbi, A. C.
Zitouni, A.
Dornaika, F.
Sbaa, S.
Taleb-Ahmed, A.
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
[36] Distill-Net: Application-Specific Distillation of Deep Convolutional Neural Networks for Resource-Constrained IoT Platforms
Motamedi, Mohammad
Portillo, Felix A.
Fong, Daniel
Ghiasi, Soheil
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
[37] A convolutional neural network for the resource-constrained project scheduling problem (RCPSP): A new approach
Golab, Amir
Gooya, Ehsan Sedgh
Al Falou, Ayman
Cabon, Mikael
DECISION SCIENCE LETTERS, 2023, 12 (02) : 225 - 238
[38] A Runtime Switchable Multi-Phase Convolutional Neural Network for Resource-Constrained Systems
Jang, Jeonggyu
Yang, Hoeseok
IEEE ACCESS, 2023, 11 : 62449 - 62461
[39] RAP: A Software Framework of Developing Convolutional Neural Networks for Resource-constrained Devices Using Environmental Monitoring as a Case Study
Tu, Chia-Heng
Sun, Qihui
Chang, Hsiao-Hsuan
ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2021, 5 (04)
[40] An automatic fire detection system based on deep convolutional neural networks for low-power, resource-constrained devices
de Venancio, Pedro Vinicius A. B.
Lisboa, Adriano C.
Barbosa, Adriano, V
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15349 - 15368

← 1 2 3 4 5 →