Context-Aware Deep Model Compression for Edge Cloud Computing

被引:13
|
作者
Wang, Lingdong [1 ]
Xiang, Liyao [1 ]
Xu, Jiayu [1 ]
Chen, Jiaju [1 ]
Zhao, Xing [1 ]
Yao, Dixi [1 ]
Wang, Xinbing [1 ]
Li, Baochun [2 ]
机构
[1] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai, Peoples R China
[2] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
来源
2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS) | 2020年
基金
中国国家自然科学基金;
关键词
Edge Cloud Computing; Neural Architecture Search; Reinforcement Learning;
D O I
10.1109/ICDCS47774.2020.00101
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While deep neural networks (DNNs) have led to a paradigm shift, its exorbitant computational requirement has always been a roadblock in its deployment to the edge, such as wearable devices and smartphones. Hence a hybrid edge-cloud computational framework is proposed to transfer part of the computation to the cloud, by naively partitioning the DNN operations under the constant network condition assumption. However, real-world network state varies greatly depending on the context, and DNN partitioning only has limited strategy space. In this paper, we explore the structural flexibility of DNN to fit the edge model to varying network contexts and different deployment platforms. Specifically, we designed a reinforcement learning-based decision engine to search for model transformation strategies in response to a combined objective of model accuracy and computation latency. The engine generates a context-aware model tree so that the DNN can decide the model branch to switch to at runtime. By the emulation and field experimental results, our approach enjoys a 30% - 50% latency reduction while retaining the model accuracy.
引用
收藏
页码:787 / 797
页数:11
相关论文
共 50 条
  • [21] Framework for context-aware computation offloading in mobile cloud computing
    Xing Chen
    Shihong Chen
    Xuee Zeng
    Xianghan Zheng
    Ying Zhang
    Chunming Rong
    Journal of Cloud Computing, 6
  • [22] Context-aware Platform for User Authentication in Cloud Database Computing
    Kim, Manjea
    Jeong, Hoon
    Choi, Euiin
    2012 INTERNATIONAL CONFERENCE ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE & ENGINEERING (FITMSE 2012), 2012, 14 : 170 - 176
  • [23] On Context-Aware Proxy in Mobile Cloud Computing for Emergency Services
    Sneps-Sneppe, Manfred
    Namiot, Dmitry
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS (ICT 2017), 2017,
  • [24] Framework for Context-aware Computation Offloading in Mobile Cloud Computing
    Liu, Zhanghui
    Zeng, Xuee
    Huang, Wensi
    Lin, Junxin
    Chen, Xing
    Guo, Wenzhong
    2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 172 - 177
  • [25] A Context-Aware Edge Computing Framework for Smart Internet of Things
    Ben Sada, Abdelkarim
    Naouri, Abdenacer
    Khelloufi, Amar
    Dhelim, Sahraoui
    Ning, Huansheng
    FUTURE INTERNET, 2023, 15 (05)
  • [26] Framework for context-aware computation offloading in mobile cloud computing
    Chen, Xing
    Chen, Shihong
    Zeng, Xuee
    Zheng, Xianghan
    Zhang, Ying
    Rong, Chunming
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2017, 6
  • [27] Context-Aware Data and Task Placement in Edge Computing Environments
    Breitbach, Martin
    Schaefer, Dominik
    Edinger, Janick
    Becker, Christian
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), 2019,
  • [28] Toward a Multidisciplinary model of context to support context-aware computing
    Bradley, NA
    Dunlop, MD
    HUMAN-COMPUTER INTERACTION, 2005, 20 (04): : 403 - 446
  • [29] Context-Aware Image Compression
    Chan, Jacky C. K.
    Mahjoubfar, Ata
    Chen, Claire L.
    Jalali, Bahram
    PLOS ONE, 2016, 11 (07):
  • [30] Context-aware pervasive computing
    Abowd, GD
    Ebling, MR
    Gellersen, HW
    Hunt, G
    Lei, H
    IEEE WIRELESS COMMUNICATIONS, 2002, 9 (05): : 8 - 9