Distributed Deep Learning Inference Acceleration using Seamless Collaboration in Edge Computing

被引：4

作者：

Li, Nan ^{[1
]}

Losifidis, Alexandros ^{[1
]}

Zhang, Qi ^{[1
]}

机构：

[1] Aarhus Univ, Dept Elect & Comp Engn, DIGIT, Aarhus, Denmark

来源：

IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022) | 2022年

关键词：

Distributed CNNs; Receptive-field; Edge computing; Inference acceleration; Service reliability; Delay constraint;

D O I：

10.1109/ICC45855.2022.9839083

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This paper studies inference acceleration using distributed convolutional neural networks (CNNs) in collaborative edge computing. To ensure inference accuracy in inference task partitioning, we consider the receptive-field when performing segment-based partitioning. To maximize the parallelization between the communication and computing processes, thereby minimizing the total inference time of an inference task, we design a novel task collaboration scheme in which the overlapping zone of the sub-tasks on secondary edge servers (ESs) is executed on the host ES, named as HALP. We further extend HALP to the scenario of multiple tasks. Experimental results show that HALP can accelerate CNN inference in VGG-16 by 1.7-2.0x for a single task and 1.7-1.8x for 4 tasks per batch on GTX 1080TI and JETSON AGX Xavier, which outperforms the state-of-the-art work MoDNN. Moreover, we evaluate the service reliability under time-variant channel, which shows that HALP is an effective solution to ensure high service reliability with strict service deadline.

引用

页码：3667 / 3672

页数：6

共 50 条

[1] ADDA: Adaptive Distributed DNN Inference Acceleration in Edge Computing Environment
Wang, Huitian
Cai, Guangxing
Huang, Zhaowu
Dong, Fang
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 438 - 445
[2] Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Liu, Zhihong
Xu, Xin
Qiao, Peng
Li, Dongsheng
ACM COMPUTING SURVEYS, 2025, 57 (04)
[3] Distributed Deep Learning in An Edge Computing System
Sen, Tanmoy
Shen, Haiying
Mehrab, Zakaria
2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 645 - 653
[4] Distributed Inference Models and Algorithms for Heterogeneous Edge Systems Using Deep Learning
Yuan, Qingqing
Li, Zhihua
APPLIED SCIENCES-BASEL, 2025, 15 (03):
[5] Collaborative edge computing for distributed CNN inference acceleration using receptive field-based segmentation
Li, Nan
Iosifidis, Alexandros
Zhang, Qi
COMPUTER NETWORKS, 2022, 214
[6] Automated Ensemble for Deep Learning Inference on Edge Computing Platforms
Bai, Yang
Chen, Lixing
Abdel-Mottaleb, Mohamed
Xu, Jie
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06): : 4202 - 4213
[7] Transformer Inference Acceleration in Edge Computing Environment
Li, Mingchu
Zhang, Wenteng
Xia, Dexin
2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 104 - 109
[8] Serving distributed inference deep learning models in serverless computing
Mahajan, Kunal
Desai, Rumit
2022 IEEE 15TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2022), 2022, : 109 - 111
[9] DEEP LEARNING AMR MODEL INFERENCE ACCELERATION WITH CFU FOR EDGE SYSTEMS
Hilei, Pavlo
Petruk, Marian
Korotkyi, Ievgen
Farenyuk, Oleg
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 66 - 70
[10] Distributed Training for Deep Learning Models On An Edge Computing Network Using Shielded Reinforcement Learning
Sen, Tanmoy
Shen, Haiying
2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 581 - 591

← 1 2 3 4 5 →