Deep Reinforcement Learning Based Resource Management for DNN Inference in Industrial IoT

被引:67
|
作者
Zhang, Weiting [1 ]
Yang, Dong [1 ]
Haixia, Peng [2 ]
Wu, Wen [2 ]
Quan, Wei [1 ]
Zhang, Hongke [1 ]
Shen, Xuemin [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 0B5, Canada
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Task analysis; Industrial Internet of Things; Computational modeling; Servers; Resource management; Cloud computing; Training; DNN inference; industrial IoT; resource management; deep reinforcement learning; ALLOCATION; INTELLIGENCE; NETWORKING; INTERNET; 5G;
D O I
10.1109/TVT.2021.3068255
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Performing deep neural network (DNN) inference in real time requires excessive network resources, which poses a big challenge to the resource-limited industrial Internet of things (IIoT) networks. To address the challenge, in this paper, we introduce an end-edge-cloud orchestration architecture, in which the inference task assignment and DNN model placement are flexibly coordinated. Specifically, the DNN models, trained and pre-stored in the cloud, are properly placed at the end and edge to perform DNN inference. To achieve efficient DNN inference, a multi-dimensional resource management problem is formulated to maximize the average inference accuracy while satisfying the strict delay requirements of inference tasks. Due to the mix-integer decision variables, it is difficult to solve the formulated problem directly. Thus, we transform the formulated problem into a Markov decision process which can be solved efficiently. Furthermore, a deep reinforcement learning based resource management scheme is proposed to make real-time optimal resource allocation decisions. Simulation results are provided to demonstrate that the proposed scheme can efficiently allocate the available spectrum, caching, and computing resources, and improve average inference accuracy by 31.4% compared with the deep deterministic policy gradient benchmark.
引用
收藏
页码:7605 / 7618
页数:14
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Based Resource Management for DNN Inference in IIoT
    Zhang, Weiting
    Yang, Dong
    Peng, Haixia
    Wu, Wen
    Quan, Wei
    Zhang, Hongke
    Shen, Xuemin
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [2] Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
    Wu, Wen
    Yang, Peng
    Zhang, Weiting
    Zhou, Conghao
    Shen, Xuemin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) : 4988 - 4998
  • [3] Deep-Reinforcement-Learning-Based Predictive Maintenance Model for Effective Resource Management in Industrial IoT
    Ong, Kevin Shen Hoong
    Wang, Wenbo
    Niyato, Dusit
    Friedrichs, Thomas
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (07) : 5173 - 5188
  • [4] Deep Reinforcement Learning Based Resource Management in UAV-Assisted IoT Networks
    Munaye, Yirga Yayeh
    Juang, Rong-Terng
    Lin, Hsin-Piao
    Tarekegn, Getaneh Berie
    Lin, Ding-Bing
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 20
  • [5] AoI-Aware Resource Scheduling for Industrial IoT with Deep Reinforcement Learning
    Li, Hongzhi
    Tang, Lin
    Chen, Shengwei
    Zheng, Libin
    Zhong, Shaohong
    [J]. ELECTRONICS, 2024, 13 (06)
  • [6] Joint DNN Partition Deployment and Resource Allocation for Delay-Sensitive Deep Learning Inference in IoT
    He, Wenchen
    Guo, Shaoyong
    Guo, Song
    Qiu, Xuesong
    Qi, Feng
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10): : 9241 - 9254
  • [7] Deep-Reinforcement-Learning-Based Spectrum Resource Management for Industrial Internet of Things
    Shi, Zhaoyuan
    Xie, Xianzhong
    Lu, Huabing
    Yang, Helin
    Kadoch, Michel
    Cheriet, Mohamed
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3476 - 3489
  • [8] Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing
    Xiong, Xiong
    Zheng, Kan
    Lei, Lei
    Hou, Lu
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (06) : 1133 - 1146
  • [9] Resource Management with Deep Reinforcement Learning
    Mao, Hongzi
    Alizadeh, Mohammad
    Menache, Ishai
    Kandula, Srikanth
    [J]. PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 50 - 56
  • [10] Deep Reinforcement Learning-Based Task Offloading and Resource Allocation for Industrial IoT in MEC Federation System
    Do, Huong Mai
    Tran, Tuan Phong
    Yoo, Myungsik
    [J]. IEEE ACCESS, 2023, 11 : 83150 - 83170