Multiagent Deep-Reinforcement-Learning-Based Virtual Resource Allocation Through Network Function Virtualization in Internet of Things

被引：35

作者：

Shah, Hurmat Ali ^{[1
]}

Zhao, Lian ^{[1
]}

机构：

[1] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 05期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Deep reinforcement learning (DRL); Internet of Things (IoT); machine learning (ML); network virtualization; optimization; Q-learning (QL); resource allocation;

D O I：

10.1109/JIOT.2020.3022572

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Resource allocation is a significant task in the emerging area of Internet of Things (IoT). IoT devices are usually low-cost devices with limited computational power and capabilities for long term communication. In this article, the network function virtualization (NFV) technique is used to access resources of the network and a reinforcement learning (RL) algorithm is used to solve the problem of resource allocation in IoT networks. The traffic of the IoT network uses the substrate network which is available through NFV for its data transmission. The data transmission needs of the IoT network are translated to virtual requests and service function chain (SFC) are mapped to the substrate network to serve the requests. The problem of SFC placement while meeting the system constraints of the IoT network is a nonconvex problem. In the proposed deep RL (DRL)-based resource allocation, the virtual layer acts as a common repository of the network resources. The optimization problem of SFC placement under the system constraints of IoT networks can be formulated as a Markovian decision process (MDP). The MDP problem is solved through a multiagent DRL algorithm where each agent serves an SFC. Two Q-networks are considered, where one Q-network solves the SFC placement problem while the other updates weights of the Q-network through keeping track of long-term policy changes. The virtual agents serving SFCs interact with the environment, receive reward collectively and update the policy by using the learned experiences. We show that the proposed scheme can solve the optimization problem of SFC placement through adequate reward design, state, and action space formulation. Simulation results demonstrate that the multiagent DRL scheme outperforms the reference schemes in terms of utility gained as measured through different network parameters.

引用

页码：3410 / 3421

页数：12

共 50 条

[1] Deep Multiagent Reinforcement-Learning-Based Resource Allocation for Internet of Controllable Things
Gu, Bo
Zhang, Xu
Lin, Ziqi
Alazab, Mamoun
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3066 - 3074
[2] Deep-Reinforcement-Learning-Based Spectrum Resource Management for Industrial Internet of Things
Shi, Zhaoyuan
Xie, Xianzhong
Lu, Huabing
Yang, Helin
Kadoch, Michel
Cheriet, Mohamed
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3476 - 3489
[3] Multiagent Deep-Reinforcement-Learning-Based Resource Allocation for Heterogeneous QoS Guarantees for Vehicular Networks
Tian, Jie
Liu, Qianqian
Zhang, Haixia
Wu, Dalei
[J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (03): : 1683 - 1695
[4] Deep-Reinforcement-Learning-Based Production Scheduling in Industrial Internet of Things
Luo, Zihui
Jiang, Chengling
Liu, Liang
Zheng, Xiaolong
Ma, Huadong
Dong, Fang
Li, Fucun
[J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (22) : 19725 - 19739
[5] Deep-Reinforcement-Learning-Based Energy-Efficient Resource Management for Social and Cognitive Internet of Things
Yang, Helin
Zhong, Wen-De
Chen, Chen
Alphones, Arokiaswami
Xie, Xianzhong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (06) : 5677 - 5689
[6] Deep-Reinforcement-Learning-Based Resource Allocation for Cloud Gaming via Edge Computing
Deng, Xiaoheng
Zhang, Jingjing
Zhang, Honggang
Jiang, Ping
[J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (06) : 5364 - 5377
[7] Multiagent Deep-Reinforcement-Learning-Based Channel Allocation for MEO-LEO Networked Telemetry System
Zeng, Guanming
Zhan, Yafeng
Xiao, Xiaolong
[J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (06) : 10817 - 10830
[8] Multiagent Federated Reinforcement Learning for Resource Allocation in UAV-Enabled Internet of Medical Things Networks
Seid, Abegaz Mohammed
Erbad, Aiman
Abishu, Hayla Nahom
Albaseer, Abdullatif
Abdallah, Mohamed
Guizani, Mohsen
[J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (22) : 19695 - 19711
[9] Deep Reinforcement Learning-Based Resource Allocation for Satellite Internet of Things with Diverse QoS Guarantee
Tang, Siqi
Pan, Zhisong
Hu, Guyu
Wu, Yang
Li, Yunbo
[J]. SENSORS, 2022, 22 (08)
[10] Network Resource Allocation Strategy Based on Deep Reinforcement Learning
Zhang, Shidong
Wang, Chao
Zhang, Junsan
Duan, Youxiang
You, Xinhong
Zhang, Peiying
[J]. IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2020, 1 (01): : 86 - 94

← 1 2 3 4 5 →