Scheduling Inputs in Early Exit Neural Networks

被引：1

作者：

Casale, Giuliano ^{[1
]}

Roveri, Manuel ^{[2
]}

机构：

[1] Imperial Coll London, Dept Comp, London SW7 2AZ, England

[2] Politecn Milan, Dipartimento Elettron Informaz & Bioingengeria, I-27100 Milan, Italy

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2024年 / 73卷 / 02期

关键词：

Neural network; early exit; configuration; threshold; scheduling; dependability; POLICIES; M/G/1/K; SERVICE; QUEUE;

D O I：

10.1109/TC.2023.3333189

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Early exit neural networks (EENs) reduce the processing times of deep convolutional neural networks by means of internal classifiers (ICs) that allow jobs, being the input of the EEN, to exit early from the processing pipeline. However, the current designs used in pervasive systems ignore variability in data arrival rates, exposing EEN-based services to potential loss of the incoming jobs, due to finite input buffer capacity. Motivated by this issue, we introduce and study the early exit scheduling problem, which aims at dynamically configuring IC thresholds at runtime to achieve effective trade-offs between job classification accuracy, processing time, and job loss ratio. We argue that deciding the EEN exit layer for a job at the start of its processing makes the problem mathematically tractable, allowing us to develop policies to control buffer backlog, classification accuracy, and processing time across the EEN layers. The main contribution of the paper is the introduction of single-exit IC threshold configurations as a mechanism to allow the scheduling policy to reliably predict the best EEN exit layer of each input job. Three scheduling policies that leverage this idea are proposed to dynamically schedule job arrivals to an EEN-based service. The proposed solution, here tailored to EENs based on convolutional neural networks (CNNs), is fairly general and can be applied to different use cases. The two application scenarios considered in this paper focus on image classification and intrusion detection. Experiments on some popular CNNs for the two aforementioned application scenarios indicate that the proposed policies can achieve significant savings in processing times and improve job loss ratio compared to both ordinary EENs and CNNs while still providing high mean classification accuracy.

引用

页码：451 / 465

页数：15

共 50 条

[1] EDANAS: Adaptive Neural Architecture Search for Early Exit Neural Networks
Gambella, Matteo
Roveri, Manuel
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[2] Early-Exit Neural Networks with Nested Prediction Sets
Jazbec, Metod
Forre, Patrick
Mandt, Stephan
Zhang, Dan
Nalisnick, Eric
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 1780 - 1796
[3] Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
Wolczyk, Maciej
Wojcik, Bartosz
Balazy, Klaudia
Podolak, Igor
Tabor, Jacek
Smieja, Marek
Trzcinski, Tomasz
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
[4] SEENN: Towards Temporal Spiking Early-Exit Neural Networks
Li, Yuhang
Geller, Tamar
Kim, Youngeun
Panda, Priyadarshini
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[5] Early-Exit with Class Exclusion for Efficient Inference of Neural Networks
Wang, Jingcun
Li, Bing
Zhang, Grace Li
2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 263 - 267
[6] SLEXNet: Adaptive Inference Using Slimmable Early Exit Neural Networks
Kutukcu, Basar
Baidya, Sabur
Dey, Sujit
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (06)
[7] Dynamic Early Exit Scheduling for Deep Neural Network Inference through Contextual Bandits
Ju, Weiyu
Bao, Wei
Ge, Liming
Yuan, Dong
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 823 - 832
[8] Neural Networks with Dependent Inputs
Mostafa Boskabadi
Mahdi Doostparast
Neural Processing Letters, 2023, 55 : 7337 - 7350
[9] Neural Networks with Dependent Inputs
Boskabadi, Mostafa
Doostparast, Mahdi
NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7337 - 7350
[10] Efficient Hardware Implementation of Cellular Neural Networks with Incremental Quantization and Early Exit
Xu, Xiaowei
Lu, Qing
Wang, Tianchen
Hu, Yu
Zhuo, Chen
Liu, Jinglan
Shi, Yiyu
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2018, 14 (04)

← 1 2 3 4 5 →