Input-Dependent Edge-Cloud Mapping of Recurrent Neural Networks Inference

被引:7
|
作者
Pagliari, Daniele Jahier [1 ]
Chiaro, Roberta [1 ]
Chen, Yukai [1 ]
Vinco, Sara [1 ]
Macii, Enrico [2 ]
Poncino, Massimo [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[2] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
来源
PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC) | 2020年
关键词
D O I
10.1109/dac18072.2020.9218595
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given the computational complexity of Recurrent Neural Networks (RNNs) inference, IoT and mobile devices typically offload this task to the cloud. However, the execution time and energy consumption of RNN inference strongly depends on the length of the processed input. Therefore, considering also communication costs, it may be more convenient to process short input sequences locally and only offload long ones to the cloud. In this paper, we propose a low-overhead runtime tool that performs this choice automatically. Results based on real edge and cloud devices show that our method is able to simultaneously reduce the total execution time and energy consumption of the system compared to solutions that run RNN inference fully locally or fully in the cloud.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Yang, Zheming
    Ji, Wen
    Wang, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [22] Attacking and Protecting Data Privacy in Edge-Cloud Collaborative Inference Systems
    He, Zecheng
    Zhang, Tianwei
    Lee, Ruby B.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9706 - 9716
  • [23] An Adaptive Task Migration Scheduling Approach for Edge-Cloud Collaborative Inference
    Zhang, Boyin
    Li, Yinggang
    Zhang, Shigeng
    Zhang, Yue
    Zhu, Bing
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [24] Energy-Aware Workload Allocation for Distributed Deep Neural Networks in Edge-Cloud Continuum
    Jin, Yi
    Xu, Jiawei
    Huan, Yuxiang
    Yan, Yulong
    Zheng, Lirong
    Zou, Zhuo
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 213 - 217
  • [25] An Energy-Efficient Method for Recurrent Neural Network Inference in Edge Cloud Computing
    Chen, Chao
    Guo, Weiyu
    Wang, Zheng
    Yang, Yongkui
    Wu, Zhuoyu
    Li, Guannan
    SYMMETRY-BASEL, 2022, 14 (12):
  • [26] JMDC: A joint model and data compression system for deep neural networks collaborative computing in edge-cloud networks
    Ding, Yi
    Fang, Weiwei
    Liu, Mengran
    Wang, Meng
    Cheng, Yusong
    Xiong, Naixue
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 173 : 83 - 93
  • [27] Optimal Placement of Recurrent Service Chains on Distributed Edge-Cloud Infrastructures
    Mahjoubi, Ayeh
    Taheri, Javid
    Grinnemo, Karl-Johan
    Deng, Shuiguang
    PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, : 495 - 502
  • [28] Cost-Effective Service Function Chain Mapping Approaches in Edge-Cloud Elastic Optical Networks
    Yu, Jun
    Zheng, Wenwen
    Shao, Weidong
    Chen, Hong
    Zheng, Danyang
    Chen, Bowen
    Wu, Jinbing
    2022 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE, ACP, 2022, : 1160 - 1162
  • [29] Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks
    Bitzer, Sebastian
    Kiebel, Stefan J.
    BIOLOGICAL CYBERNETICS, 2012, 106 (4-5) : 201 - 217
  • [30] Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks
    Sebastian Bitzer
    Stefan J. Kiebel
    Biological Cybernetics, 2012, 106 : 201 - 217