Scheduling Inputs in Early Exit Neural Networks

被引:1
|
作者
Casale, Giuliano [1 ]
Roveri, Manuel [2 ]
机构
[1] Imperial Coll London, Dept Comp, London SW7 2AZ, England
[2] Politecn Milan, Dipartimento Elettron Informaz & Bioingengeria, I-27100 Milan, Italy
关键词
Neural network; early exit; configuration; threshold; scheduling; dependability; POLICIES; M/G/1/K; SERVICE; QUEUE;
D O I
10.1109/TC.2023.3333189
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Early exit neural networks (EENs) reduce the processing times of deep convolutional neural networks by means of internal classifiers (ICs) that allow jobs, being the input of the EEN, to exit early from the processing pipeline. However, the current designs used in pervasive systems ignore variability in data arrival rates, exposing EEN-based services to potential loss of the incoming jobs, due to finite input buffer capacity. Motivated by this issue, we introduce and study the early exit scheduling problem, which aims at dynamically configuring IC thresholds at runtime to achieve effective trade-offs between job classification accuracy, processing time, and job loss ratio. We argue that deciding the EEN exit layer for a job at the start of its processing makes the problem mathematically tractable, allowing us to develop policies to control buffer backlog, classification accuracy, and processing time across the EEN layers. The main contribution of the paper is the introduction of single-exit IC threshold configurations as a mechanism to allow the scheduling policy to reliably predict the best EEN exit layer of each input job. Three scheduling policies that leverage this idea are proposed to dynamically schedule job arrivals to an EEN-based service. The proposed solution, here tailored to EENs based on convolutional neural networks (CNNs), is fairly general and can be applied to different use cases. The two application scenarios considered in this paper focus on image classification and intrusion detection. Experiments on some popular CNNs for the two aforementioned application scenarios indicate that the proposed policies can achieve significant savings in processing times and improve job loss ratio compared to both ordinary EENs and CNNs while still providing high mean classification accuracy.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 50 条
  • [1] EDANAS: Adaptive Neural Architecture Search for Early Exit Neural Networks
    Gambella, Matteo
    Roveri, Manuel
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Early-Exit Neural Networks with Nested Prediction Sets
    Jazbec, Metod
    Forre, Patrick
    Mandt, Stephan
    Zhang, Dan
    Nalisnick, Eric
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2024, 244 : 1780 - 1796
  • [3] Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
    Wolczyk, Maciej
    Wojcik, Bartosz
    Balazy, Klaudia
    Podolak, Igor
    Tabor, Jacek
    Smieja, Marek
    Trzcinski, Tomasz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [4] SEENN: Towards Temporal Spiking Early-Exit Neural Networks
    Li, Yuhang
    Geller, Tamar
    Kim, Youngeun
    Panda, Priyadarshini
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Early-Exit with Class Exclusion for Efficient Inference of Neural Networks
    Wang, Jingcun
    Li, Bing
    Zhang, Grace Li
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 263 - 267
  • [6] SLEXNet: Adaptive Inference Using Slimmable Early Exit Neural Networks
    Kutukcu, Basar
    Baidya, Sabur
    Dey, Sujit
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (06)
  • [7] Dynamic Early Exit Scheduling for Deep Neural Network Inference through Contextual Bandits
    Ju, Weiyu
    Bao, Wei
    Ge, Liming
    Yuan, Dong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 823 - 832
  • [8] Neural Networks with Dependent Inputs
    Mostafa Boskabadi
    Mahdi Doostparast
    Neural Processing Letters, 2023, 55 : 7337 - 7350
  • [9] Neural Networks with Dependent Inputs
    Boskabadi, Mostafa
    Doostparast, Mahdi
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7337 - 7350
  • [10] Efficient Hardware Implementation of Cellular Neural Networks with Incremental Quantization and Early Exit
    Xu, Xiaowei
    Lu, Qing
    Wang, Tianchen
    Hu, Yu
    Zhuo, Chen
    Liu, Jinglan
    Shi, Yiyu
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2018, 14 (04)