A pipelining task offloading strategy via delay-aware multi-agent reinforcement learning in Cybertwin-enabled 6G network

被引:0
|
作者
Haiwen Niu [1 ,2 ,3 ]
Luhan Wang [1 ,2 ,3 ]
Keliang Du [1 ,2 ,3 ]
Zhaoming Lu [1 ,2 ,3 ]
Xiangming Wen [1 ,2 ,3 ]
Yu Liu [1 ,2 ,3 ]
机构
[1] Beijing Laboratory of Advanced Information Networks,Beijing Univof Posts&Telecom
[2] Beijing Key Lab of Network System Architecture and Convergence,Beijing Univof Posts&Telecom
[3] School of Information and Communication Engineering,Beijing Univof
关键词
D O I
暂无
中图分类号
TN929.5 [移动通信]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cybertwin-enabled 6th Generation(6G) network is envisioned to support artificial intelligence-native management to meet changing demands of 6G applications. Multi-Agent Deep Reinforcement Learning(MADRL) technologies driven by Cybertwins have been proposed for adaptive task offloading strategies. However, the existence of random transmission delay between Cybertwin-driven agents and underlying networks is not considered in related works, which destroys the standard Markov property and increases the decision reaction time to reduce the task offloading strategy performance. In order to address this problem, we propose a pipelining task offloading method to lower the decision reaction time and model it as a delay-aware Markov Decision Process(MDP). Then, we design a delay-aware MADRL algorithm to minimize the weighted sum of task execution latency and energy consumption. Firstly, the state space is augmented using the lastly-received state and historical actions to rebuild the Markov property. Secondly, Gate Transformer-XL is introduced to capture historical actions' importance and maintain the consistent input dimension dynamically changed due to random transmission delays. Thirdly, a sampling method and a new loss function with the difference between the current and target state value and the difference between real state-action value and augmented state-action value are designed to obtain state transition trajectories close to the real ones. Numerical results demonstrate that the proposed methods are effective in reducing reaction time and improving the task offloading performance in the random-delay Cybertwin-enabled 6G networks.
引用
收藏
页码:92 / 105
页数:14
相关论文
共 31 条
  • [21] On Learning Intrinsic Rewards for Faster Multi-Agent Reinforcement Learning based MAC Protocol Design in 6G Wireless Networks
    Miuccio, Luciano
    Riolo, Salvatore
    Bennis, Mehdi
    Panno, Daniela
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 466 - 471
  • [22] Multi-Agent Deep Reinforcement Learning Based Dynamic Task Offloading in a Device-to-Device Mobile-Edge Computing Network to Minimize Average Task Delay with Deadline Constraints
    He, Huaiwen
    Yang, Xiangdong
    Mi, Xin
    Shen, Hong
    Liao, Xuefeng
    SENSORS, 2024, 24 (16)
  • [23] A Delay-Optimal Task Scheduling Strategy for Vehicle Edge Computing Based on the Multi-Agent Deep Reinforcement Learning Approach
    Nie, Xuefang
    Yan, Yunhui
    Zhou, Tianqing
    Chen, Xingbang
    Zhang, Dingding
    ELECTRONICS, 2023, 12 (07)
  • [24] Distributed Channel Allocation for Mobile 6G Subnetworks via Multi-Agent Deep Q-Learning
    Adeogun, Ramoni
    Berardinelli, Gilberto
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [25] An intelligent task offloading method based on multi-agent deep reinforcement learning in ultra-dense heterogeneous network with mobile edge computing
    Pang, Shanchen
    Wang, Teng
    Gui, Haiyuan
    He, Xiao
    Hou, Lili
    COMPUTER NETWORKS, 2024, 250
  • [26] Reinforcement Learning Based Edge-End Collaboration for Multi-Task Scheduling in 6G Enabled Intelligent Autonomous Transport Systems
    Li, Peisong
    Xiao, Ziren
    Gao, Honghao
    Wang, Xinheng
    Wang, Ye
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [27] Deep Reinforcement Learning-Based Multi-node Collaborative Task Offloading Optimization in 6G Space-Air-Ground Integrated Networks
    Li, Wang
    Chen, Xin
    Jiao, Libo
    Ning, Wangzhong
    Zhu, Wenwu
    MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, MOBIQUITOUS 2023, PT I, 2024, 593 : 313 - 332
  • [28] Spatio-Temporal Cellular Network Traffic Prediction Using Multi-Task Deep Learning for AI-Enabled 6G
    Sun, Xiaochuan
    Wei, Biao
    Gao, Jiahui
    Cao, Difei
    Li, Zhigang
    Li, Yingqi
    Journal of Beijing Institute of Technology (English Edition), 2022, 31 (05): : 441 - 453
  • [29] Spatio-Temporal Cellular Network Traffic Prediction Using Multi-Task Deep Learning for AI-Enabled 6G
    Xiaochuan Sun
    Biao Wei
    Jiahui Gao
    Difei Cao
    Zhigang Li
    Yingqi Li
    Journal of Beijing Institute of Technology, 2022, 31 (05) : 441 - 453
  • [30] Hybrid Multi-Agent Deep Reinforcement Learning for Active-IRS-Based Rate Maximization Over 6G UAV Mobile Wireless Networks
    Yi, Shuming
    Wang, Fei
    Zhang, Xi
    MILCOM 2023 - 2023 IEEE MILITARY COMMUNICATIONS CONFERENCE, 2023,