Distributed Deep Learning Inference Acceleration using Seamless Collaboration in Edge Computing

被引:4
|
作者
Li, Nan [1 ]
Losifidis, Alexandros [1 ]
Zhang, Qi [1 ]
机构
[1] Aarhus Univ, Dept Elect & Comp Engn, DIGIT, Aarhus, Denmark
来源
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022) | 2022年
关键词
Distributed CNNs; Receptive-field; Edge computing; Inference acceleration; Service reliability; Delay constraint;
D O I
10.1109/ICC45855.2022.9839083
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper studies inference acceleration using distributed convolutional neural networks (CNNs) in collaborative edge computing. To ensure inference accuracy in inference task partitioning, we consider the receptive-field when performing segment-based partitioning. To maximize the parallelization between the communication and computing processes, thereby minimizing the total inference time of an inference task, we design a novel task collaboration scheme in which the overlapping zone of the sub-tasks on secondary edge servers (ESs) is executed on the host ES, named as HALP. We further extend HALP to the scenario of multiple tasks. Experimental results show that HALP can accelerate CNN inference in VGG-16 by 1.7-2.0x for a single task and 1.7-1.8x for 4 tasks per batch on GTX 1080TI and JETSON AGX Xavier, which outperforms the state-of-the-art work MoDNN. Moreover, we evaluate the service reliability under time-variant channel, which shows that HALP is an effective solution to ensure high service reliability with strict service deadline.
引用
收藏
页码:3667 / 3672
页数:6
相关论文
共 50 条
  • [1] ADDA: Adaptive Distributed DNN Inference Acceleration in Edge Computing Environment
    Wang, Huitian
    Cai, Guangxing
    Huang, Zhaowu
    Dong, Fang
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 438 - 445
  • [2] Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
    Liu, Zhihong
    Xu, Xin
    Qiao, Peng
    Li, Dongsheng
    ACM COMPUTING SURVEYS, 2025, 57 (04)
  • [3] Distributed Deep Learning in An Edge Computing System
    Sen, Tanmoy
    Shen, Haiying
    Mehrab, Zakaria
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 645 - 653
  • [4] Distributed Inference Models and Algorithms for Heterogeneous Edge Systems Using Deep Learning
    Yuan, Qingqing
    Li, Zhihua
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [5] Collaborative edge computing for distributed CNN inference acceleration using receptive field-based segmentation
    Li, Nan
    Iosifidis, Alexandros
    Zhang, Qi
    COMPUTER NETWORKS, 2022, 214
  • [6] Automated Ensemble for Deep Learning Inference on Edge Computing Platforms
    Bai, Yang
    Chen, Lixing
    Abdel-Mottaleb, Mohamed
    Xu, Jie
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06): : 4202 - 4213
  • [7] Transformer Inference Acceleration in Edge Computing Environment
    Li, Mingchu
    Zhang, Wenteng
    Xia, Dexin
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 104 - 109
  • [8] Serving distributed inference deep learning models in serverless computing
    Mahajan, Kunal
    Desai, Rumit
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2022), 2022, : 109 - 111
  • [9] DEEP LEARNING AMR MODEL INFERENCE ACCELERATION WITH CFU FOR EDGE SYSTEMS
    Hilei, Pavlo
    Petruk, Marian
    Korotkyi, Ievgen
    Farenyuk, Oleg
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 66 - 70
  • [10] Distributed Training for Deep Learning Models On An Edge Computing Network Using Shielded Reinforcement Learning
    Sen, Tanmoy
    Shen, Haiying
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 581 - 591