Edge intelligence in motion: Mobility-aware dynamic DNN inference service migration with downtime in mobile edge computing

被引:0
|
作者
Wang, Pu [1 ,2 ]
Ouyang, Tao [1 ]
Liao, Guocheng [3 ]
Gong, Jie [1 ]
Yu, Shuai [1 ]
Chen, Xu [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Sun Yat Sen Univ, Sch Biomed Engn, Guangzhou, Peoples R China
[3] Sun Yat Sen Univ, Sch Software Engn, Zhuhai, Peoples R China
基金
美国国家科学基金会;
关键词
DNN service migration; Multi-exit DNN; Service downtime; Mobile edge computing; Model predictive control; PLACEMENT;
D O I
10.1016/j.sysarc.2022.102664
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Edge intelligence (EI) becomes a trend to push the deep learning frontiers to the network edge, so that deep neural networks (DNNs) applications can be well leveraged at resource-constrained mobile devices with benefits of edge computing. Due to the high user mobility among scattered edge servers in many scenarios such as internet of vehicular applications, dynamic service migration is desired to maintain a reliable and efficient quality of service (QoS). However, inevitable service downtime incurred by service migration would largely degrade the real-time performance of delay-sensitive DNN inference services. To address this issue, we advocate a user-centric management for dynamic DNN inference service migration with flexible multi-exit mechanism, aiming at maximizing overall user utility (e.g., DNN model inference accuracy) with various service downtime. We first leverage dynamic programming to propose an optimal offline migration and exit point selection strategy (OMEPS) algorithm when complete future information of user behaviors is available. Amenable to a more practical application domain without complete future information, we incorporate the OMEPS algorithm into a model predictive control (MPC) framework, then construct a mobility-aware service migration and DNN exit point selection (MOMEPS) algorithm, which improves the long-term user utility within limited predictive future information. However, heavy computation overheads of MOMEPS algorithm impose burdens on mobile devices, thus we further advocate a cost-efficient algorithm, named smart-MOMEPS, which introduces a smart migration judgement based on Neural Networks to control the implementation of (MOMEPS) algorithm by wisely estimating whether the DNN service should be migrated or not. Extensive trace-driven simulation results demonstrate the superior performance of our smart-MOMEPS algorithm for achieving significant overall utility improvements with low computation overheads compared with other online algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Edge intelligence in motion: Mobility-aware dynamic DNN inference service migration with downtime in mobile edge computing
    Wang, Pu
    Ouyang, Tao
    Liao, Guocheng
    Gong, Jie
    Yu, Shuai
    Chen, Xu
    [J]. Journal of Systems Architecture, 2022, 130
  • [2] Follow Me at the Edge: Mobility-Aware Dynamic Service Placement for Mobile Edge Computing
    Ouyang, Tao
    Zhou, Zhi
    Chen, Xu
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (10) : 2333 - 2345
  • [3] Follow Me at the Edge: Mobility-Aware Dynamic Service Placement for Mobile Edge Computing
    Tao Ouyang
    Zhi Zhou
    Xu Chen
    [J]. 2018 IEEE/ACM 26TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2018,
  • [4] Mobility-aware personalized service recommendation in mobile edge computing
    Hongxia Zhang
    Yanhui Dong
    Yongjin Yang
    [J]. EURASIP Journal on Wireless Communications and Networking, 2021
  • [5] Mobility-aware personalized service recommendation in mobile edge computing
    Zhang, Hongxia
    Dong, Yanhui
    Yang, Yongjin
    [J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [6] Mobility-Aware Service Selection in Mobile Edge Computing Systems
    Wu, Hongyue
    Deng, Shuiguang
    Li, Wei
    Yin, Jianwei
    Li, Xiaohong
    Feng, Zhiyong
    Zomaya, Albert
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 201 - 208
  • [7] Digital twin-assisted and mobility-aware service migration in Mobile Edge Computing
    Bozkaya, Elif
    [J]. COMPUTER NETWORKS, 2023, 231
  • [8] Mobility-aware edge server placement for mobile edge computing*
    Chen, Yuanyi
    Wang, Dezhi
    Wu, Nailong
    Xiang, Zhengzhe
    [J]. COMPUTER COMMUNICATIONS, 2023, 208 : 136 - 146
  • [9] PDMA: Probabilistic service migration approach for delay-aware and mobility-aware mobile edge computing
    Xu, Minxian
    Zhou, Qiheng
    Wu, Huaming
    Lin, Weiwei
    Ye, Kejiang
    Xu, Chengzhong
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (02): : 394 - 414
  • [10] Mobility-Aware Service Caching in Mobile Edge Computing for Internet of Things
    Wei, Hua
    Luo, Hong
    Sun, Yan
    [J]. SENSORS, 2020, 20 (03)