Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

被引：42

作者：

Akbari, Mohammad ^{[1
]}

Abedi, Mohammad Reza ^{[2
]}

Joda, Roghayeh ^{[1
,3
]}

Pourghasemian, Mohsen ^{[2
]}

Mokari, Nader ^{[2
]}

Erol-Kantarci, Melike ^{[3
]}

机构：

[1] ICT Res Inst, Commun Dept, Tehran 1439955471, Iran

[2] Tarbiat Modares Univ, Fac Elect & Comp Engn ECE, Tehran 14115111, Iran

[3] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2021年 / 39卷 / 08期

基金：

美国国家科学基金会;

关键词：

Industrial Internet of Things; Delays; Measurement; Information age; Reinforcement learning; Quality of service; Resource management; network function virtualization; age of information; deep reinforcement learning; compound actions; multi-agent; RESOURCE-ALLOCATION; INTERNET; PLACEMENT; THINGS;

D O I：

10.1109/JSAC.2021.3087264

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to-end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents' collaboration.

引用

页码：2487 / 2500

页数：14

共 50 条

[1] AoI-Aware Resource Scheduling for Industrial IoT with Deep Reinforcement Learning
Li, Hongzhi
Tang, Lin
Chen, Shengwei
Zheng, Libin
Zhong, Shaohong
[J]. ELECTRONICS, 2024, 13 (06)
[2] Dynamic VNF Scheduling: A Deep Reinforcement Learning Approach
Zhang, Zixiao
He, Fujun
Oki, Eiji
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (07) : 557 - 570
[3] Deep Reinforcement Learning for Data Freshness-oriented Scheduling in Industrial IoT
Li, Jiaping
Tang, Jianhua
Liu, Zilong
[J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6271 - 6276
[4] Deep Reinforcement Learning for AoI Aware VNF Placement in Multiple Source Systems
Chen, Zhenke
Li, He
Ota, Kaoru
Dong, Mianxiong
[J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2873 - 2878
[5] Delay-Aware VNF Scheduling: A Reinforcement Learning Approach With Variable Action Set
Li, Junling
Shi, Weisen
Zhang, Ning
Shen, Xuemin
[J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 304 - 318
[6] Deep Reinforcement Learning for IoT Networks: Age of Information and Energy Cost Tradeoff
Wu, Xiongwei
Li, Xiuhua
Li, Jun
Ching, P. C.
Poor, H. Vincent
[J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[7] Energy-aware task scheduling and offloading using deep reinforcement learning in SDN-enabled IoT network
Sellami, Bassem
Hakiri, Akram
Ben Yahia, Sadok
Berthou, Pascal
[J]. COMPUTER NETWORKS, 2022, 210
[8] Energy-Aware Dynamic VNF Splitting in O-RAN Using Deep Reinforcement Learning
Amiri, Esmaeil
Wang, Ning
Shojafar, Mohammad
Tafazolli, Rahim
[J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (11) : 1891 - 1895
[9] A Trust and Energy-Aware Double Deep Reinforcement Learning Scheduling Strategy for Federated Learning on IoT Devices
Rjoub, Gaith
Wahab, Omar Abdel
Bentahar, Jamal
Bataineh, Ahmed
[J]. SERVICE-ORIENTED COMPUTING (ICSOC 2020), 2020, 12571 : 319 - 333
[10] Deep reinforcement learning for blockchain in industrial IoT: A survey
Wu, Yulei
Wang, Zehua
Ma, Yuxiang
Leung, Victor C. M.
[J]. COMPUTER NETWORKS, 2021, 191

← 1 2 3 4 5 →