Energy-Efficient Deep Learning Task Scheduling Strategy for Edge Device

被引:0
|
作者
Ren J. [1 ]
Gao L. [2 ,3 ]
Yu J.-L. [2 ]
Yuan L. [2 ]
机构
[1] School of Computer Science, Shaanxi Normal University, Xi'an
[2] School of Information and Technology, Northwest University, Xi'an
[3] School of Computer Science, Xi'an Polytechnic University, Xi'an
来源
基金
中国国家自然科学基金;
关键词
Deep learning model; Edge devices; Energy efficiency; Task scheduling strategy;
D O I
10.11897/SP.J.1016.2020.00440
中图分类号
学科分类号
摘要
The deep neural network has made significant progress in many fields. Its powerful computing ability makes it an efficient tool to solve complex problems, and has been widely used in automatic driving, face recognition, and augmented reality. Due to the outstanding performance of deep learning in the fields of image recognition and natural language processing, applying the deep learning model on mobile application is inevitable. Typically, the deep learning model relies on high-performance servers equipped with strong computing processors and large storage. However, because of the unstable mobile networks and limited bandwidth, running deep learning on the cloud server may cause a response delay, which violates the quality of user experience, and running the inference task on the cloud also has the privacy problem. At the same time, the researcher tries to execute the inference task on the user's own device, mainly focus on the on-device deep learning by using model compression techniques and develop the light-weight deep model, and all of them will sacrifice the model accuracy. Because of the limited resources of the mobile terminal (computing power, storage size, and battery capacity), the mobile device cannot satisfy the DNN model. We need to design a new computing paradigm so that the Deep Neural Network (DNN) based model can meet the user's expectations for fast response, low energy consumption, and high accuracy. This paper proposes a novel scheduling strategy, Edge-based strategy, for deep learning inference tasks by using edge devices. The Edge-based strategy combines the mobility of the user's mobile device with the powerful computing processors on edge server. Firstly, the strategy selects and deploys the appropriate DNN models by considering the inference time and accuracy. Specifically, the Edge-based strategy evaluates the candidate deep models on user mobile devices, and record the inference time and failure classification samples, the inference time is the first priority on mobile devices, then the strategy deploy the deep model with the least inference time on mobile devices, and input the failure sample to the other deep models and select the model with highest accuracy and deploy it on the edge device. After deploying the model on both devices, Edge-based strategy focuses on how to schedule the inference task between two devices to achieve the best performance. The core of task scheduling is the pre-trained classification model, it takes account of the input data complexity, and user expectations and schedule the inference task dynamically. This paper compares four typical machine learning techniques to train the classification model, and the random forest gives the highest accuracy. This paper takes the image recognition application as an example, and evaluate 12 popular CNN models on RaspberryPi 3B+, Jetson TX2 respectively, the experimental results show that in the mobile network environment, Edge-based strategy can effectively improve the performance of the deep model and reducing the overhead of inference, our approach outperforms the model with the highest accuracy by 93.2%, 91.6%, and 3.88% for energy consumption, inference time and accuracy. © 2020, Science Press. All right reserved.
引用
收藏
页码:440 / 452
页数:12
相关论文
共 50 条
  • [31] Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles
    Zhang, Lixiang
    Yan, Yan
    Hu, Yaoguang
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 35 (08) : 3875 - 3888
  • [32] Energy-efficient real-time task scheduling with task rejection
    Chen, Jian-Jia
    Kuo, Tei-Wei
    Yang, Chia-Lin
    King, Ku-Jei
    [J]. 2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 1629 - +
  • [33] Energy-Efficient Task Offloading and Resource Allocation via Deep Reinforcement Learning for Augmented Reality in Mobile Edge Networks
    Chen, Xing
    Liu, Guizhong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (13) : 10843 - 10856
  • [34] Q-learning based dynamic task scheduling for energy-efficient cloud computing
    Ding, Ding
    Fan, Xiaocong
    Zhao, Yihuan
    Kang, Kaixuan
    Yin, Qian
    Zeng, Jing
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 108 : 361 - 371
  • [35] Dynamic Reinforcement Learning based Scheduling for Energy-Efficient Edge-Enabled LoRaWAN
    Mhatre, Jui
    Lee, Ahyoung
    [J]. 2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,
  • [36] A Hot/Cold Task Partition for Energy-Efficient Neural Network Deployment on Heterogeneous Edge Device
    Wane, Jihe
    Zhao, Jiaxiang
    Wang, Danghui
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 563 - 574
  • [37] ETS-TEE: An Energy-Efficient Task Scheduling Strategy in a Mobile Trusted Computing Environment
    Wang, Hai
    Cai, Lu
    Hao, Xuan
    Ren, Jie
    Ma, Yuhui
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2023, 28 (01) : 105 - 116
  • [38] Optimization of Task-Scheduling Strategy in Edge Kubernetes Clusters Based on Deep Reinforcement Learning
    Wang, Xin
    Zhao, Kai
    Qin, Bin
    [J]. MATHEMATICS, 2023, 11 (20)
  • [39] EASE: Energy-efficient task scheduling for edge computing under uncertain runtime and unstable communication conditions
    Yan, Hui
    Li, Ya
    Zhu, Xiaomin
    Zhang, Dayu
    Wang, Ji
    Chen, Huangke
    Bao, Weidong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (07):
  • [40] Energy-Efficient Task Scheduling in Fog Environment Using TOPSIS
    Nathawat, Sukhvinder Singh
    Garg, Ritu
    [J]. SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, PT 2, ICSOFTCOMP 2023, 2024, 2031 : 16 - 28