Scheduling Inputs in Early Exit Neural Networks

被引:0
|
作者
Casale, Giuliano [1 ]
Roveri, Manuel [2 ]
机构
[1] Imperial Coll London, Dept Comp, London SW7 2AZ, England
[2] Politecn Milan, Dipartimento Elettron Informaz & Bioingengeria, I-27100 Milan, Italy
关键词
Neural network; early exit; configuration; threshold; scheduling; dependability; POLICIES; M/G/1/K; SERVICE; QUEUE;
D O I
10.1109/TC.2023.3333189
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Early exit neural networks (EENs) reduce the processing times of deep convolutional neural networks by means of internal classifiers (ICs) that allow jobs, being the input of the EEN, to exit early from the processing pipeline. However, the current designs used in pervasive systems ignore variability in data arrival rates, exposing EEN-based services to potential loss of the incoming jobs, due to finite input buffer capacity. Motivated by this issue, we introduce and study the early exit scheduling problem, which aims at dynamically configuring IC thresholds at runtime to achieve effective trade-offs between job classification accuracy, processing time, and job loss ratio. We argue that deciding the EEN exit layer for a job at the start of its processing makes the problem mathematically tractable, allowing us to develop policies to control buffer backlog, classification accuracy, and processing time across the EEN layers. The main contribution of the paper is the introduction of single-exit IC threshold configurations as a mechanism to allow the scheduling policy to reliably predict the best EEN exit layer of each input job. Three scheduling policies that leverage this idea are proposed to dynamically schedule job arrivals to an EEN-based service. The proposed solution, here tailored to EENs based on convolutional neural networks (CNNs), is fairly general and can be applied to different use cases. The two application scenarios considered in this paper focus on image classification and intrusion detection. Experiments on some popular CNNs for the two aforementioned application scenarios indicate that the proposed policies can achieve significant savings in processing times and improve job loss ratio compared to both ordinary EENs and CNNs while still providing high mean classification accuracy.
引用
收藏
页码:451 / 465
页数:15
相关论文
共 50 条
  • [1] EDANAS: Adaptive Neural Architecture Search for Early Exit Neural Networks
    Gambella, Matteo
    Roveri, Manuel
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [2] Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
    Wolczyk, Maciej
    Wojcik, Bartosz
    Balazy, Klaudia
    Podolak, Igor
    Tabor, Jacek
    Smieja, Marek
    Trzcinski, Tomasz
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [3] SEENN: Towards Temporal Spiking Early-Exit Neural Networks
    Li, Yuhang
    Geller, Tamar
    Kim, Youngeun
    Panda, Priyadarshini
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Early-Exit with Class Exclusion for Efficient Inference of Neural Networks
    Wang, Jingcun
    Li, Bing
    Zhang, Grace Li
    [J]. 2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 263 - 267
  • [5] Dynamic Early Exit Scheduling for Deep Neural Network Inference through Contextual Bandits
    Ju, Weiyu
    Bao, Wei
    Ge, Liming
    Yuan, Dong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 823 - 832
  • [6] Neural Networks with Dependent Inputs
    Mostafa Boskabadi
    Mahdi Doostparast
    [J]. Neural Processing Letters, 2023, 55 : 7337 - 7350
  • [7] Neural Networks with Dependent Inputs
    Boskabadi, Mostafa
    Doostparast, Mahdi
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7337 - 7350
  • [8] Class-specific early exit design methodology for convolutional neural networks
    Bonato, Vanderlei
    Bouganis, Christos-Savvas
    [J]. APPLIED SOFT COMPUTING, 2021, 107
  • [9] Efficient Hardware Implementation of Cellular Neural Networks with Incremental Quantization and Early Exit
    Xu, Xiaowei
    Lu, Qing
    Wang, Tianchen
    Hu, Yu
    Zhuo, Chen
    Liu, Jinglan
    Shi, Yiyu
    [J]. ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2018, 14 (04)
  • [10] Towards Edge Computing Using Early-Exit Convolutional Neural Networks
    Pacheco, Roberto G.
    Bochie, Kaylani
    Gilbert, Mateus S.
    Couto, Rodrigo S.
    Campista, Miguel Elias M.
    [J]. INFORMATION, 2021, 12 (10)