To cloud or not to cloud: an on-line scheduler for dynamic privacy-protection of deep learning workload on edge devices

被引:0
|
作者
Yibin Tang
Ying Wang
Huawei Li
Xiaowei Li
机构
[1] Chinese Academy of Sciences,State Key Laboratory of Computer Architecture, Institute of Computing Technology
[2] University of Chinese Academy of Sciences,undefined
[3] Peng Cheng Laboratory,undefined
[4] Wuhan Digital Engineering Institute,undefined
关键词
Real-time; Deep learning; Edge computing; Privacy protection;
D O I
暂无
中图分类号
学科分类号
摘要
Recently deep learning applications are thriving on edge and mobile computing scenarios, due to the concerns of latency constraints, data security and privacy, and other considerations. However, because of the limitation of power delivery, battery lifetime and computation resource, offering real-time neural network inference ability has to resort to the specialized energy-efficient architecture, and sometimes the coordination between the edge devices and the powerful cloud or fog facilities. This work investigates a realistic scenario when an on-line scheduler is needed to meet the requirement of latency even when the edge computing resources and communication speed are dynamically fluctuating, while protecting the privacy of users as well. It also leverages the approximate computing feature of neural networks and actively trade-off excessive neural network propagation paths for latency guarantee even when local resource provision is unstable. Combining neural network approximation and dynamic scheduling, the real-time deep learning system could adapt to different requirements of latency/accuracy and the resource fluctuation of mobile-cloud applications. Experimental results also demonstrate that the proposed scheduler significantly improves the energy efficiency of real-time neural networks on edge devices.
引用
下载
收藏
页码:85 / 100
页数:15
相关论文
共 50 条
  • [41] Distributed Deep Neural Network Deployment for Smart Devices from the Edge to the Cloud
    Lin, Chang-You
    Wang, Tzu-Chen
    Chen, Kuan-Chih
    Lee, Bor-Yan
    Kuo, Jian-Jhih
    PROCEEDINGS OF THE 2019 ACM MOBIHOCWORKSHOP ON PERVASIVE SYSTEMS IN THE IOT ERA (PERSIST-IOT '19), 2019, : 43 - 48
  • [42] Research of Lightweight Cloud Edge Collaboration Framework Based on Edge Agent and Deep Learning
    Li X.
    Ren Y.
    Jin T.
    Pei C.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2023, 52 (05): : 756 - 764
  • [43] CoEdge: Exploiting the Edge-Cloud Collaboration for Faster Deep Learning
    Hu, Liangyan
    Sun, Guodong
    Ren, Yanlong
    IEEE ACCESS, 2020, 8 : 100533 - 100541
  • [44] A Deep Learning Image Recognition Method Based on Edge Cloud Computing
    Wei, Rui
    Engineering Intelligent Systems, 2023, 31 (01): : 5 - 12
  • [45] A Dynamic Service Adaptation Algorithm for Seamless Integration of Cloud Infrastructure and Edge Devices
    Yang, Liu
    Li, Yi
    SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 182 - 193
  • [46] Deep Reinforcement Learning for Dynamic Workflow Scheduling in Cloud Environment
    Dong, Tingting
    Xue, Fei
    Xiao, Changbai
    Zhang, Jiangjiang
    2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 107 - 115
  • [47] A Computing Resource Allocation Optimization Strategy for Massive Internet of Health Things Devices Considering Privacy Protection in Cloud Edge Computing Environment
    Jianxi Wang
    Liutao Wang
    Journal of Grid Computing, 2021, 19
  • [48] A Computing Resource Allocation Optimization Strategy for Massive Internet of Health Things Devices Considering Privacy Protection in Cloud Edge Computing Environment
    Wang, Jianxi
    Wang, Liutao
    JOURNAL OF GRID COMPUTING, 2021, 19 (02)
  • [49] Protecting Inference Privacy With Accuracy Improvement in Mobile-Cloud Deep Learning
    Wang, Shulan
    Liu, Qin
    Xu, Yang
    Jiang, Hongbo
    Wu, Jie
    Wang, Tian
    Peng, Tao
    Wang, Guojun
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 6522 - 6537
  • [50] A SELF-ADAPTIVE DEEP LEARNING-BASED MODEL TO PREDICT CLOUD WORKLOAD
    Borna, K.
    Ghanbari, R.
    NEURAL NETWORK WORLD, 2023, 33 (03) : 161 - 169