Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

被引:35
|
作者
Shah, Hurmat Ali [1 ]
Zhao, Lian [1 ]
机构
[1] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep reinforcement learning (DRL); Internet of Things (IoT); machine learning (ML); network virtualization; optimization; Q-learning (QL); resource allocation;
D O I
10.1109/JIOT.2020.3022572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Resource allocation is a significant task in the emerging area of Internet of Things (IoT). IoT devices are usually low-cost devices with limited computational power and capabilities for long term communication. In this article, the network function virtualization (NFV) technique is used to access resources of the network and a reinforcement learning (RL) algorithm is used to solve the problem of resource allocation in IoT networks. The traffic of the IoT network uses the substrate network which is available through NFV for its data transmission. The data transmission needs of the IoT network are translated to virtual requests and service function chain (SFC) are mapped to the substrate network to serve the requests. The problem of SFC placement while meeting the system constraints of the IoT network is a nonconvex problem. In the proposed deep RL (DRL)-based resource allocation, the virtual layer acts as a common repository of the network resources. The optimization problem of SFC placement under the system constraints of IoT networks can be formulated as a Markovian decision process (MDP). The MDP problem is solved through a multiagent DRL algorithm where each agent serves an SFC. Two Q-networks are considered, where one Q-network solves the SFC placement problem while the other updates weights of the Q-network through keeping track of long-term policy changes. The virtual agents serving SFCs interact with the environment, receive reward collectively and update the policy by using the learned experiences. We show that the proposed scheme can solve the optimization problem of SFC placement through adequate reward design, state, and action space formulation. Simulation results demonstrate that the multiagent DRL scheme outperforms the reference schemes in terms of utility gained as measured through different network parameters.
引用
收藏
页码:3410 / 3421
页数:12
相关论文
共 50 条
  • [1] Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things
    Gu, Bo
    Zhang, Xu
    Lin, Ziqi
    Alazab, Mamoun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3066 - 3074
  • [2] Deep-Reinforcement-Learning-Based Spectrum Resource Management for Industrial Internet of Things
    Shi, Zhaoyuan
    Xie, Xianzhong
    Lu, Huabing
    Yang, Helin
    Kadoch, Michel
    Cheriet, Mohamed
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3476 - 3489
  • [3] Multiagent Deep-Reinforcement-Learning-Based Resource Allocation for Heterogeneous QoS Guarantees for Vehicular Networks
    Tian, Jie
    Liu, Qianqian
    Zhang, Haixia
    Wu, Dalei
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03): : 1683 - 1695
  • [4] Deep-Reinforcement-Learning-Based Production Scheduling in Industrial Internet of Things
    Luo, Zihui
    Jiang, Chengling
    Liu, Liang
    Zheng, Xiaolong
    Ma, Huadong
    Dong, Fang
    Li, Fucun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (22) : 19725 - 19739
  • [5] Deep-Reinforcement-Learning-Based Energy-Efficient Resource Management for Social and Cognitive Internet of Things
    Yang, Helin
    Zhong, Wen-De
    Chen, Chen
    Alphones, Arokiaswami
    Xie, Xianzhong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06) : 5677 - 5689
  • [6] Deep-Reinforcement-Learning-Based Resource Allocation for Cloud Gaming via Edge Computing
    Deng, Xiaoheng
    Zhang, Jingjing
    Zhang, Honggang
    Jiang, Ping
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (06) : 5364 - 5377
  • [7] Multiagent Deep-Reinforcement-Learning-Based Channel Allocation for MEO-LEO Networked Telemetry System
    Zeng, Guanming
    Zhan, Yafeng
    Xiao, Xiaolong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 10817 - 10830
  • [8] Multiagent Federated Reinforcement Learning for Resource Allocation in UAV-Enabled Internet of Medical Things Networks
    Seid, Abegaz Mohammed
    Erbad, Aiman
    Abishu, Hayla Nahom
    Albaseer, Abdullatif
    Abdallah, Mohamed
    Guizani, Mohsen
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (22) : 19695 - 19711
  • [9] Deep Reinforcement Learning-Based Resource Allocation for Satellite Internet of Things with Diverse QoS Guarantee
    Tang, Siqi
    Pan, Zhisong
    Hu, Guyu
    Wu, Yang
    Li, Yunbo
    [J]. SENSORS, 2022, 22 (08)
  • [10] Network Resource Allocation Strategy Based on Deep Reinforcement Learning
    Zhang, Shidong
    Wang, Chao
    Zhang, Junsan
    Duan, Youxiang
    You, Xinhong
    Zhang, Peiying
    [J]. IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2020, 1 (01): : 86 - 94