Joint Exit Selection and Offloading Decision for Applications Based on Deep Neural Networks

被引:0
|
作者
Narmeen, Ramsha [1 ]
Mach, Pavel [1 ]
Becvar, Zdenek [1 ]
Ahmad, Ishtiaq [1 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Prague 16627, Czech Republic
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 23期
关键词
Delays; Task analysis; Servers; Artificial neural networks; Computer architecture; Accuracy; Energy consumption; Deep neural network (DNN); delay; edge computing; energy; exit selection; offloading; RESOURCE-ALLOCATION; POWER-CONTROL; EDGE; INTERNET;
D O I
10.1109/JIOT.2024.3444898
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
User applications based on the deep neural networks (DNNs), such as object or anomaly detection, image recognition, or language processing, running on computation- and energy-constrained user equipment (UE) can be partially or fully processed in the edge computing servers to reduce a processing time and save an energy in the UE. To further reduce the processing time and the UE's energy consumption, DNN with multiple exit points can be incorporated. In this article, we address the problem of the decision on whether the computation should be offloaded from the UE to the edge computing server or processed locally by the UE and we solve this problem jointly and "on-the-fly" together with DNN exit selection. Since the formulated problem is very complex, we exploit the deep deterministic policy gradient for the exit selection and the offloading decisions (labeled DDPG-EOD) for the DNN-based applications. To this end, we first convert the problem into the Markov decision process, and then, we employ an end-to-end learning via DDPG with the actor-critic architecture. Second, we use a knowledge distillation-based technique to efficiently select the DNN's exit to minimize the delay and energy consumption. Simulation results show that the proposal is highly scalable, converges very quickly, and surpasses the best performing state-of-the-art approach by up to 120% and 100% in terms of the overall DNN processing delay and the energy consumption, respectively.
引用
收藏
页码:38098 / 38112
页数:15
相关论文
共 50 条
  • [31] Soft Sensors Based on Deep Neural Networks for Applications in Security and Safety
    Xibilia, Maria Gabriella
    Latino, Mariangela
    Marinkovic, Zlatica
    Atanaskovic, Aleksandar
    Donato, Nicola
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2020, 69 (10) : 7869 - 7876
  • [32] Soft sensor development and applications based on LSTM in deep neural networks
    Ke, Wensi
    Huang, Dexian
    Yang, Fan
    Jiang, Yongheng
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3468 - 3473
  • [33] Genetic and deep learning clusters based on neural networks for management decision structures
    Will Serrano
    Neural Computing and Applications, 2020, 32 : 4187 - 4211
  • [34] Infrared Object Classification Using Decision Tree Based Deep Neural Networks
    Gundogdu, Erhan
    Koc, Aykut
    Alatan, A. Aydin
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1913 - 1916
  • [35] Genetic and deep learning clusters based on neural networks for management decision structures
    Serrano, Will
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4187 - 4211
  • [36] Breast Cancer Diagnosis based on Joint Variable Selection and Constructive Deep Neural Network
    Zemouri, R.
    Omri, N.
    Devalland, C.
    Arnould, L.
    Morello, B.
    Zerhouni, N.
    Fnaiech, E.
    2018 IEEE 4TH MIDDLE EAST CONFERENCE ON BIOMEDICAL ENGINEERING (MECBME), 2018, : 159 - 164
  • [37] Deep reinforcement learning-based joint task offloading and resource allocation in multipath transmission vehicular networks
    Yin, Chenyang
    Zhang, Yuyang
    Dong, Ping
    Zhang, Hongke
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (01)
  • [38] Privacy-Preserving Computation Offloading for Parallel Deep Neural Networks Training
    Mao, Yunlong
    Hong, Wenbo
    Wang, Heng
    Li, Qun
    Zhong, Sheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (07) : 1777 - 1788
  • [39] Adaptive joint compression method for deep neural networks
    Yao B.
    Peng X.
    Yu X.
    Liu L.
    Peng Y.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (05): : 21 - 32
  • [40] Implementation of Deep Neural Networks for Industry Applications
    Rozycki, Pawel
    Kolbusz, Janusz
    Krzos, Grzegorz
    Wilamowski, Bogdan M.
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 2717 - 2722