Context-Aware Deep Model Compression for Edge Cloud Computing

被引:13
|
作者
Wang, Lingdong [1 ]
Xiang, Liyao [1 ]
Xu, Jiayu [1 ]
Chen, Jiaju [1 ]
Zhao, Xing [1 ]
Yao, Dixi [1 ]
Wang, Xinbing [1 ]
Li, Baochun [2 ]
机构
[1] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai, Peoples R China
[2] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
Edge Cloud Computing; Neural Architecture Search; Reinforcement Learning;
D O I
10.1109/ICDCS47774.2020.00101
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While deep neural networks (DNNs) have led to a paradigm shift, its exorbitant computational requirement has always been a roadblock in its deployment to the edge, such as wearable devices and smartphones. Hence a hybrid edge-cloud computational framework is proposed to transfer part of the computation to the cloud, by naively partitioning the DNN operations under the constant network condition assumption. However, real-world network state varies greatly depending on the context, and DNN partitioning only has limited strategy space. In this paper, we explore the structural flexibility of DNN to fit the edge model to varying network contexts and different deployment platforms. Specifically, we designed a reinforcement learning-based decision engine to search for model transformation strategies in response to a combined objective of model accuracy and computation latency. The engine generates a context-aware model tree so that the DNN can decide the model branch to switch to at runtime. By the emulation and field experimental results, our approach enjoys a 30% - 50% latency reduction while retaining the model accuracy.
引用
收藏
页码:787 / 797
页数:11
相关论文
共 50 条
  • [1] Context-Aware Access Control Model for Cloud Computing
    Zhou, Zhenji
    Wu, Lifa
    Hong, Zheng
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2013, 6 (06): : 1 - 12
  • [2] Context-Aware Verifiable Cloud Computing
    Yan, Zheng
    Yu, Xixun
    Ding, Wenxiu
    IEEE ACCESS, 2017, 5 : 2211 - 2227
  • [3] Context-Aware Cloud Service Selection Model for Mobile Cloud Computing Environments
    Wu, Xu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [4] Context-aware information broker for cloud computing
    Klančnik, Tomaz
    Blazič, Borka Jerman
    International Review on Computers and Software, 2010, 5 (01) : 52 - 58
  • [5] Toward Context-Aware SLA for Cloud Computing
    Labidi, Taher
    Mtibaa, Achraf
    Gaaloul, Walid
    Gargouri, Faiez
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 350 - 359
  • [6] Context-aware Distributed Storage in Mobile Cloud Computing
    Han, Dong
    Yan, Ye
    Shu, Tao
    2015 IEEE 12th International Conference on Mobile Ad Hoc and Sensor Systems (MASS), 2015, : 460 - 461
  • [7] A Context-Aware Authentication System for Mobile Cloud Computing
    Benzekki, Kamal
    El Fergougui, Abdeslam
    ElBelrhiti ElAlaoui, Abdelbaki
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 : 379 - 387
  • [8] Mobility and Context-Aware Offloading in Mobile Cloud Computing
    Roostaei, Razie
    Movahedi, Zeinab
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 1144 - 1148
  • [9] Context-Aware Service Modes in the Cloud Computing Environment
    Pan Yu
    Luo Lijuan
    Gao Li
    Lv Tingjie
    CHINA COMMUNICATIONS, 2012, 9 (02) : 86 - 95
  • [10] Context-aware Job Scheduling for Cloud Computing Environments
    Assuncao, Marcos D.
    Netto, Marco A. S.
    Koch, Fernando
    Bianchi, Silvia
    2012 IEEE/ACM FIFTH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2012), 2012, : 255 - 262