AdaEE: Adaptive Early-Exit DNN Inference Through Multi-Armed Bandits

被引:1
|
作者
Pacheco, Roberto G. [1 ]
Shifrin, Mark [2 ]
Couto, Rodrigo S. [1 ]
Menasche, Daniel S. [1 ]
Hanawal, Manjesh K. [3 ]
Campista, Miguel Elias M. [1 ]
机构
[1] Univ Fed Rio de Janeiro, Rio De Janeiro, Brazil
[2] Ben Gurion Univ Negev, Beer Sheva, Israel
[3] Indian Inst Technol, Bombay, Maharashtra, India
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1109/ICC45041.2023.10279243
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Deep Neural Networks (DNNs) are widely used to solve a growing number of tasks, such as image classification. However, their deployment at resource-constrained devices still poses challenges related to energy consumption and delay overheads. Early-Exit DNNs (EE-DNNs) address the challenges by adding side branches through their architecture. Under an edge-cloud co-inference, if the confidence at a side branch is larger than a fixed confidence threshold, the inference is performed completely at the edge device, saving computation for more difficult observations. Otherwise, the edge device offloads the inference task to the cloud, incurring overhead. Despite its success, EE-DNNs for image classification have to cope with distorted images. The baseline distortion level depends on the environmental context, e.g., time of the day, lighting, and weather conditions. To cope with varying distortion, we propose Adaptive Early-Exit in Deep Neural Networks (AdaEE), a novel algorithm to dynamically adjust the confidence threshold based on context, leveraging the Upper Confidence Bound (UCB) for that matter. AdaEE provably achieves logarithmic regret under mild conditions. We experimentally verify that 1) convergence occurs after collecting a few thousand observations for images with different distortion levels and overhead values, and 2) AdaEE obtains a lower cumulative regret when compared against alternatives using the Caltech-256 dataset subject to varying distortion.
引用
收藏
页码:3726 / 3731
页数:6
相关论文
共 50 条
  • [1] Online Multi-Armed Bandits with Adaptive Inference
    Dimakopoulou, Maria
    Ren, Zhimei
    Zhou, Zhengyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Learning Early Exit for Deep Neural Network Inference on Mobile Devices through Multi-Armed Bandits
    Ju, Weiyu
    Bao, Wei
    Yuan, Dong
    Ge, Liming
    Zhou, Bing Bing
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 11 - 20
  • [3] Multi-Armed Bandits for Adaptive Constraint Propagation
    Balafrej, Amine
    Bessiere, Christian
    Paparrizou, Anastasia
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 290 - 296
  • [4] An empirical evaluation of active inference in multi-armed bandits
    Markovic, Dimitrije
    Stojic, Hrvoje
    Schwoebel, Sarah
    Kiebel, Stefan J.
    NEURAL NETWORKS, 2021, 144 : 229 - 246
  • [5] Adaptive Data Depth via Multi-Armed Bandits
    Baharav, Tavor Z.
    Lai, Tze Leung
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [6] Empowering Adaptive Early-Exit Inference with Latency Awareness
    Tan, Xinrui
    Li, Hongjia
    Wang, Liming
    Huang, Xueqing
    Xu, Zhen
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9825 - 9833
  • [7] On Kernelized Multi-armed Bandits
    Chowdhury, Sayak Ray
    Gopalan, Aditya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [8] Multi-armed Bandits with Compensation
    Wang, Siwei
    Huang, Longbo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [9] Regional Multi-Armed Bandits
    Wang, Zhiyang
    Zhou, Ruida
    Shen, Cong
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [10] Federated Multi-Armed Bandits
    Shi, Chengshuai
    Shen, Cong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9603 - 9611