Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

被引:35
|
作者
Shah, Hurmat Ali [1 ]
Zhao, Lian [1 ]
机构
[1] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep reinforcement learning (DRL); Internet of Things (IoT); machine learning (ML); network virtualization; optimization; Q-learning (QL); resource allocation;
D O I
10.1109/JIOT.2020.3022572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Resource allocation is a significant task in the emerging area of Internet of Things (IoT). IoT devices are usually low-cost devices with limited computational power and capabilities for long term communication. In this article, the network function virtualization (NFV) technique is used to access resources of the network and a reinforcement learning (RL) algorithm is used to solve the problem of resource allocation in IoT networks. The traffic of the IoT network uses the substrate network which is available through NFV for its data transmission. The data transmission needs of the IoT network are translated to virtual requests and service function chain (SFC) are mapped to the substrate network to serve the requests. The problem of SFC placement while meeting the system constraints of the IoT network is a nonconvex problem. In the proposed deep RL (DRL)-based resource allocation, the virtual layer acts as a common repository of the network resources. The optimization problem of SFC placement under the system constraints of IoT networks can be formulated as a Markovian decision process (MDP). The MDP problem is solved through a multiagent DRL algorithm where each agent serves an SFC. Two Q-networks are considered, where one Q-network solves the SFC placement problem while the other updates weights of the Q-network through keeping track of long-term policy changes. The virtual agents serving SFCs interact with the environment, receive reward collectively and update the policy by using the learned experiences. We show that the proposed scheme can solve the optimization problem of SFC placement through adequate reward design, state, and action space formulation. Simulation results demonstrate that the multiagent DRL scheme outperforms the reference schemes in terms of utility gained as measured through different network parameters.
引用
下载
收藏
页码:3410 / 3421
页数:12
相关论文
共 50 条
  • [21] Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Wireless Systems
    Sun, Shu
    Li, Xiaofeng
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 648 - 660
  • [22] Novel Optical Access Network Virtualization and Dynamic Resource Allocation Algorithms for the Internet of Things
    Wang, Jing
    Cvijetic, Neda
    Kanonakis, Konstantinos
    Wang, Ting
    Chang, Gee-Kung
    2015 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2015,
  • [23] Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things
    Lin, Lixia
    Zhou, Wen'an
    Yang, Zhicheng
    Liu, Jianlong
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2023, 16 (01) : 170 - 188
  • [24] Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things
    Lixia Lin
    Wen’an Zhou
    Zhicheng Yang
    Jianlong Liu
    Peer-to-Peer Networking and Applications, 2023, 16 : 170 - 188
  • [25] Multiagent Deep Reinforcement Learning for Task Offloading and Resource Allocation in Cybertwin-Based Networks
    Hou, Wenjing
    Wen, Hong
    Song, Huanhuan
    Lei, Wenxin
    Zhang, Wei
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (22) : 16256 - 16268
  • [26] Internet of Things Virtual Networks Bringing Network Virtualization to Resource-constrained Devices
    Ishaq, Isam
    Hoebeke, Jeroen
    Moerman, Ingrid
    Demeester, Piet
    2012 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS, CONFERENCE ON INTERNET OF THINGS, AND CONFERENCE ON CYBER, PHYSICAL AND SOCIAL COMPUTING (GREENCOM 2012), 2012, : 293 - 300
  • [27] Adaptive Resource Allocation Method Based on Deep Q Network for Industrial Internet of Things
    Lai, Xiaolong
    Hu, Qing
    Wang, Weixin
    Fei, Li
    Huang, Ying
    IEEE ACCESS, 2020, 8 : 27426 - 27434
  • [28] Toward Deep Q-Network-Based Resource Allocation in Industrial Internet of Things
    Liang, Fan
    Yu, Wei
    Liu, Xing
    Griffith, David
    Golmie, Nada
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 9138 - 9150
  • [29] Deep-Reinforcement-Learning-Based Mode Selection and Resource Allocation for Cellular V2X Communications
    Zhang, Xinran
    Peng, Mugen
    Yan, Shi
    Sun, Yaohua
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) : 6380 - 6391
  • [30] Deep-Reinforcement-Learning-Based Joint Caching and Resources Allocation for Cooperative MEC
    Zhang, Wenqian
    Zhang, Guanglin
    Mao, Shiwen
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 12203 - 12215