Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

被引:42
|
作者
Akbari, Mohammad [1 ]
Abedi, Mohammad Reza [2 ]
Joda, Roghayeh [1 ,3 ]
Pourghasemian, Mohsen [2 ]
Mokari, Nader [2 ]
Erol-Kantarci, Melike [3 ]
机构
[1] ICT Res Inst, Commun Dept, Tehran 1439955471, Iran
[2] Tarbiat Modares Univ, Fac Elect & Comp Engn ECE, Tehran 14115111, Iran
[3] Univ Ottawa, Sch Elect Engn & Comp Sci, Ottawa, ON K1N 6N5, Canada
基金
美国国家科学基金会;
关键词
Industrial Internet of Things; Delays; Measurement; Information age; Reinforcement learning; Quality of service; Resource management; network function virtualization; age of information; deep reinforcement learning; compound actions; multi-agent; RESOURCE-ALLOCATION; INTERNET; PLACEMENT; THINGS;
D O I
10.1109/JSAC.2021.3087264
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to-end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents' collaboration.
引用
收藏
页码:2487 / 2500
页数:14
相关论文
共 50 条
  • [1] AoI-Aware Resource Scheduling for Industrial IoT with Deep Reinforcement Learning
    Li, Hongzhi
    Tang, Lin
    Chen, Shengwei
    Zheng, Libin
    Zhong, Shaohong
    [J]. ELECTRONICS, 2024, 13 (06)
  • [2] Dynamic VNF Scheduling: A Deep Reinforcement Learning Approach
    Zhang, Zixiao
    He, Fujun
    Oki, Eiji
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2023, E106B (07) : 557 - 570
  • [3] Deep Reinforcement Learning for Data Freshness-oriented Scheduling in Industrial IoT
    Li, Jiaping
    Tang, Jianhua
    Liu, Zilong
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 6271 - 6276
  • [4] Deep Reinforcement Learning for AoI Aware VNF Placement in Multiple Source Systems
    Chen, Zhenke
    Li, He
    Ota, Kaoru
    Dong, Mianxiong
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2873 - 2878
  • [5] Delay-Aware VNF Scheduling: A Reinforcement Learning Approach With Variable Action Set
    Li, Junling
    Shi, Weisen
    Zhang, Ning
    Shen, Xuemin
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 304 - 318
  • [6] Deep Reinforcement Learning for IoT Networks: Age of Information and Energy Cost Tradeoff
    Wu, Xiongwei
    Li, Xiuhua
    Li, Jun
    Ching, P. C.
    Poor, H. Vincent
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [7] Energy-aware task scheduling and offloading using deep reinforcement learning in SDN-enabled IoT network
    Sellami, Bassem
    Hakiri, Akram
    Ben Yahia, Sadok
    Berthou, Pascal
    [J]. COMPUTER NETWORKS, 2022, 210
  • [8] Energy-Aware Dynamic VNF Splitting in O-RAN Using Deep Reinforcement Learning
    Amiri, Esmaeil
    Wang, Ning
    Shojafar, Mohammad
    Tafazolli, Rahim
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (11) : 1891 - 1895
  • [9] A Trust and Energy-Aware Double Deep Reinforcement Learning Scheduling Strategy for Federated Learning on IoT Devices
    Rjoub, Gaith
    Wahab, Omar Abdel
    Bentahar, Jamal
    Bataineh, Ahmed
    [J]. SERVICE-ORIENTED COMPUTING (ICSOC 2020), 2020, 12571 : 319 - 333
  • [10] Deep reinforcement learning for blockchain in industrial IoT: A survey
    Wu, Yulei
    Wang, Zehua
    Ma, Yuxiang
    Leung, Victor C. M.
    [J]. COMPUTER NETWORKS, 2021, 191