Joint Exit Selection and Offloading Decision for Applications Based on Deep Neural Networks

被引:0
|
作者
Narmeen, Ramsha [1 ]
Mach, Pavel [1 ]
Becvar, Zdenek [1 ]
Ahmad, Ishtiaq [1 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Prague 16627, Czech Republic
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 23期
关键词
Delays; Task analysis; Servers; Artificial neural networks; Computer architecture; Accuracy; Energy consumption; Deep neural network (DNN); delay; edge computing; energy; exit selection; offloading; RESOURCE-ALLOCATION; POWER-CONTROL; EDGE; INTERNET;
D O I
10.1109/JIOT.2024.3444898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
User applications based on the deep neural networks (DNNs), such as object or anomaly detection, image recognition, or language processing, running on computation- and energy-constrained user equipment (UE) can be partially or fully processed in the edge computing servers to reduce a processing time and save an energy in the UE. To further reduce the processing time and the UE's energy consumption, DNN with multiple exit points can be incorporated. In this article, we address the problem of the decision on whether the computation should be offloaded from the UE to the edge computing server or processed locally by the UE and we solve this problem jointly and "on-the-fly" together with DNN exit selection. Since the formulated problem is very complex, we exploit the deep deterministic policy gradient for the exit selection and the offloading decisions (labeled DDPG-EOD) for the DNN-based applications. To this end, we first convert the problem into the Markov decision process, and then, we employ an end-to-end learning via DDPG with the actor-critic architecture. Second, we use a knowledge distillation-based technique to efficiently select the DNN's exit to minimize the delay and energy consumption. Simulation results show that the proposal is highly scalable, converges very quickly, and surpasses the best performing state-of-the-art approach by up to 120% and 100% in terms of the overall DNN processing delay and the energy consumption, respectively.
引用
收藏
页码:38098 / 38112
页数:15
相关论文
共 50 条
  • [1] Early-exit deep neural networks for distorted images: providing an efficient edge offloading
    Pacheco, Roberto G.
    Oliveira, Fernanda D. V. R.
    Couto, Rodrigo S.
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [2] Feature Selection for Deep Neural Networks in Cyber Security Applications
    Davis, Alexander
    Gill, Sumanjit
    Wong, Robert
    Tayeb, Shahab
    2020 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS 2020), 2020, : 82 - 88
  • [3] Deep Neural Networks meet computation offloading in mobile edge networks: Applications, taxonomy, and open issues
    Mustafa, Ehzaz
    Shuja, Junaid
    Rehman, Faisal
    Riaz, Ahsan
    Maray, Mohammed
    Bilal, Muhammad
    Khan, Muhammad Khurram
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 226
  • [4] Interpretation of Deep Neural Networks Based on Decision Trees
    Ueno, Tsukasa
    Zhao, Qiangfu
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 256 - 261
  • [5] Dataflow-based Joint Quantization for Deep Neural Networks
    Geng, Xue
    Fu, Jie
    Zhao, Bin
    Lin, Jie
    Aly, Mohamed M. Sabry
    Pal, Christopher
    Chandrasekhar, Vijay
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 574 - 574
  • [6] Feature Extraction for Deep Neural Networks Based on Decision Boundaries
    Woo, Seongyoun
    Lee, Chulhee
    PATTERN RECOGNITION AND TRACKING XXVIII, 2017, 10203
  • [7] Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks
    Li, Mingyong
    An, Ziye
    Wei, Qinmin
    Xiang, Kaiyue
    Ma, Yan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [8] Supervised Learning Based Algorithm Selection for Deep Neural Networks
    Shi, Shaohuai
    Xu, Pengfei
    Chu, Xiamen
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 344 - 351
  • [9] Decision Boundaries of Deep Neural Networks
    Karimi, Hamid
    Derr, Tyler
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1085 - 1092
  • [10] AdaInNet: an adaptive inference engine for distributed deep neural networks offloading in IoT-FOG applications based on reinforcement learning
    Etefaghi, Amir
    Sharifian, Saeed
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (02): : 1592 - 1621