DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT

被引:26
|
作者
Fan, Wenhao [1 ,2 ]
Chen, Zeyu [1 ,2 ]
Hao, Zhibo [1 ,2 ]
Su, Yi [1 ,2 ]
Wu, Fan [1 ,2 ]
Tang, Bihua [1 ,2 ]
Liu, Yuan'an [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Work Safety Intelligent Monitoring, Beijing 100876, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Deep neural network (DNN) inference; edge computing; industrial Internet of Things (IIoT); resource management; task offloading; EDGE; IOT;
D O I
10.1109/TII.2022.3192882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joint task inference, which fully utilizes end edge cloud cooperation, can effectively enhance the performance of deep neural network (DNN) inference services in the industrial internet of things (IIoT) applications. In this paper, we propose a novel joint resource management scheme for a multi task and multi service scenario consisting of multiple sensors, a cloud server, and a base station equipped with an edge server . A time slotted system model is proposed, incorporating DNN deployment, data size control, task offloading, computing resource allocation, and wireless channel allocation. Among them, the DNN deployment is to deploy proper DNNs on the edge server under its total resource constraint, and the data size control is to make trade off between task inference accuracy and task transmission delay through changing task da ta size. Our goal is to minimize the total cost including total task processing delay and total error inference penalty while guaranteeing long term task queue stability and all task inference accuracy requirements. Leveraging the Lyapunov optimization, we first transform the optimization problem into a deterministic problem for each time slot. Then, a deep deterministic policy gradient (DDPG) based deep reinforcement learning (DRL) algorithm is designed to provide the near optimal solution. We further desi gn a fast numerical method for the data size control sub problem to reduce the training complexity of the DRL model, and design a penalty mechanism to prevent frequent optimizations of DNN deployment. Extensive experiments are conducted by varying differen t crucial parameters. The superiority of our scheme is demonstrated in comparison with 3 other schemes.
引用
收藏
页码:1634 / 1646
页数:13
相关论文
共 50 条
  • [41] Joint Resource Allocation and Multi-Part Collaborative Task Offloading in MEC Systems
    Zhang, Hongxia
    Yang, Yongjin
    Shang, Bodong
    Zhang, Peiying
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) : 8877 - 8890
  • [42] Joint Task Offloading and Resource Allocation in Vehicular Edge Computing Networks for Emergency Logistics
    Li, Rui
    Ling, Darong
    Wang, Yisheng
    Zhao, Shuang
    Wang, Jun
    Li, Jun
    Mathematical Problems in Engineering, 2023, 2023
  • [43] Joint DNN Partition Deployment and Resource Allocation for Delay-Sensitive Deep Learning Inference in IoT
    He, Wenchen
    Guo, Shaoyong
    Guo, Song
    Qiu, Xuesong
    Qi, Feng
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10): : 9241 - 9254
  • [44] Joint Task Offloading and Resource Allocation for Multi-Task Multi-Server NOMA-MEC Networks
    Xue, Jianbin
    An, Yaning
    IEEE ACCESS, 2021, 9 : 16152 - 16163
  • [45] Joint Task Offloading, Resource Allocation, and Trajectory Design for Multi-UAV Cooperative Edge Computing With Task Priority
    Hao, Hao
    Xu, Changqiao
    Zhang, Wei
    Yang, Shujie
    Muntean, Gabriel-Miro
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (09) : 8649 - 8663
  • [46] Joint Computation Offloading and Task Caching Strategy for MEC-Enabled IIoT
    Deng, Yunfeng
    Sun, Haifeng
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 349 - 361
  • [47] Game Theory based Joint Task Offloading and Resource Allocation Algorithm for Mobile Edge Computing
    Li, Ning
    Yan, Jianen
    Zhang, Zhaoxin
    Martinez, Jose Fernan
    Yuan, Xin
    2020 16TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2020), 2020, : 791 - 796
  • [48] Multiobjective Optimization for Joint Task Offloading, Power Assignment, and Resource Allocation in Mobile Edge Computing
    Wang, Peng
    Li, Kenli
    Xiao, Bin
    Li, Keqin
    IEEE INTERNET OF THINGS JOURNAL, 2021, 9 (14) : 11737 - 11748
  • [49] Dependency-Aware Joint Task Offloading and Resource Allocation in Heterogeneous Mobile Edge Computing
    Zhang, Guo
    Zhang, Baoxian
    Peng, Shuo
    Li, Cheng
    IEEE Transactions on Wireless Communications, 2024, 23 (12) : 19444 - 19458
  • [50] Joint Task Offloading and Resource Allocation for Mobile Edge Computing in Ultra-Dense Network
    Cheng, Zhipeng
    Min, Minghui
    Gao, Zhibin
    Huang, Lianfen
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,