Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

被引：35

作者：

Shah, Hurmat Ali ^{[1
]}

Zhao, Lian ^{[1
]}

机构：

[1] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Deep reinforcement learning (DRL); Internet of Things (IoT); machine learning (ML); network virtualization; optimization; Q-learning (QL); resource allocation;

D O I：

10.1109/JIOT.2020.3022572

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Resource allocation is a significant task in the emerging area of Internet of Things (IoT). IoT devices are usually low-cost devices with limited computational power and capabilities for long term communication. In this article, the network function virtualization (NFV) technique is used to access resources of the network and a reinforcement learning (RL) algorithm is used to solve the problem of resource allocation in IoT networks. The traffic of the IoT network uses the substrate network which is available through NFV for its data transmission. The data transmission needs of the IoT network are translated to virtual requests and service function chain (SFC) are mapped to the substrate network to serve the requests. The problem of SFC placement while meeting the system constraints of the IoT network is a nonconvex problem. In the proposed deep RL (DRL)-based resource allocation, the virtual layer acts as a common repository of the network resources. The optimization problem of SFC placement under the system constraints of IoT networks can be formulated as a Markovian decision process (MDP). The MDP problem is solved through a multiagent DRL algorithm where each agent serves an SFC. Two Q-networks are considered, where one Q-network solves the SFC placement problem while the other updates weights of the Q-network through keeping track of long-term policy changes. The virtual agents serving SFCs interact with the environment, receive reward collectively and update the policy by using the learned experiences. We show that the proposed scheme can solve the optimization problem of SFC placement through adequate reward design, state, and action space formulation. Simulation results demonstrate that the multiagent DRL scheme outperforms the reference schemes in terms of utility gained as measured through different network parameters.

引用

下载

页码：3410 / 3421

页数：12

共 50 条

[21] Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Wireless Systems
Sun, Shu
Li, Xiaofeng
INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 648 - 660
[22] Novel Optical Access Network Virtualization and Dynamic Resource Allocation Algorithms for the Internet of Things
Wang, Jing
Cvijetic, Neda
Kanonakis, Konstantinos
Wang, Ting
Chang, Gee-Kung
2015 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2015,
[23] Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things
Lin, Lixia
Zhou, Wen'an
Yang, Zhicheng
Liu, Jianlong
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2023, 16 (01) : 170 - 188
[24] Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things
Lixia Lin
Wen’an Zhou
Zhicheng Yang
Jianlong Liu
Peer-to-Peer Networking and Applications, 2023, 16 : 170 - 188
[25] Multiagent Deep Reinforcement Learning for Task Offloading and Resource Allocation in Cybertwin-Based Networks
Hou, Wenjing
Wen, Hong
Song, Huanhuan
Lei, Wenxin
Zhang, Wei
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (22) : 16256 - 16268
[26] Internet of Things Virtual Networks Bringing Network Virtualization to Resource-constrained Devices
Ishaq, Isam
Hoebeke, Jeroen
Moerman, Ingrid
Demeester, Piet
2012 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS, CONFERENCE ON INTERNET OF THINGS, AND CONFERENCE ON CYBER, PHYSICAL AND SOCIAL COMPUTING (GREENCOM 2012), 2012, : 293 - 300
[27] Adaptive Resource Allocation Method Based on Deep Q Network for Industrial Internet of Things
Lai, Xiaolong
Hu, Qing
Wang, Weixin
Fei, Li
Huang, Ying
IEEE ACCESS, 2020, 8 : 27426 - 27434
[28] Toward Deep Q-Network-Based Resource Allocation in Industrial Internet of Things
Liang, Fan
Yu, Wei
Liu, Xing
Griffith, David
Golmie, Nada
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) : 9138 - 9150
[29] Deep-Reinforcement-Learning-Based Mode Selection and Resource Allocation for Cellular V2X Communications
Zhang, Xinran
Peng, Mugen
Yan, Shi
Sun, Yaohua
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) : 6380 - 6391
[30] Deep-Reinforcement-Learning-Based Joint Caching and Resources Allocation for Cooperative MEC
Zhang, Wenqian
Zhang, Guanglin
Mao, Shiwen
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (07) : 12203 - 12215

← 1 2 3 4 5 →