Multi-agent DRL for edge computing: A real-time proportional compute offloading

被引:3
|
作者
Jia, Kunkun [1 ]
Xia, Hui [1 ]
Zhang, Rui [1 ]
Sun, Yue [1 ]
Wang, Kai [2 ]
机构
[1] Ocean Univ China, Coll Comp Sci & Technol, Qingdao 266100, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264200, Peoples R China
基金
中国国家自然科学基金;
关键词
Computation offloading; Edge computing; Deep reinforcement learning; Orthogonal frequency division multiple-access; REINFORCEMENT; INTERNET; THINGS; AWARE;
D O I
10.1016/j.comnet.2024.110665
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the Industrial Internet of Things, devices with limited computing power and energy storage often rely on offloading tasks to edge servers for processing. However, existing methods are plagued by the high cost of device communication and unstable training processes. Consequently, Deep reinforcement learning (DRL) has emerged as a promising solution to tackle the computation offloading problem. In this paper, we propose a framework called multi-agent twin delayed shared deep deterministic policy gradient algorithm (MASTD3) based on DRL. Firstly, we formulate the task offloading conundrum as a long-term optimization problem, which aids in mitigating the challenge of deciding between local or remote task execution by a device, leading to more effective task offloading management. Secondly, we enhance MASTD3 by introducing a priority experience replay buffer mechanism and a model sample replay buffer mechanism, thus improving sample utilization and overcoming the cold-start problem associated with long-term optimization. Moreover, we refine the actor critic structure, enabling all agents to share the same critic network. This modification accelerates convergence speed during the training process and reduces computational costs during runtime. Finally, experimental results demonstrate that MASTD3 effectively addresses the proportional offloading problem, which is optimized by 44.32%, 29.26%, and 17.47% compared to DDPQN, MADDPG, and FLoadNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] RtDS: real-time distributed strategy for multi-period task offloading in vehicular edge computing environment
    Liu, Chunhui
    Liu, Kai
    Ren, Hualing
    Xu, Xincao
    Xie, Ruitao
    Cao, Jingjing
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17): : 12373 - 12387
  • [42] DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields
    Patel, Dipam
    Phu Pham
    Bera, Aniket
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 5050 - 5055
  • [43] Multi-agent reinforcement learning for task offloading with hybrid decision space in multi-access edge computing
    Wang, Ji
    Zhang, Miao
    Yin, Quanjun
    Yin, Lujia
    Peng, Yong
    AD HOC NETWORKS, 2025, 166
  • [44] Multi-Agent DRL-based Multi-Objective Demand Response Optimization for Real-Time Energy Management in Smart Homes
    Abishu, Hayla Nahom
    Seid, Abegaz Mohammed
    Marquez-Sanchez, Sergio
    Fernandez, Javier Hernandez
    Corchado, Juan Manuel
    Erbad, Aiman
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 1210 - 1217
  • [45] A Task Offloading and Resource Allocation Strategy Based on Multi-Agent Reinforcement Learning in Mobile Edge Computing
    Jiang, Guiwen
    Huang, Rongxi
    Bao, Zhiming
    Wang, Gaocai
    FUTURE INTERNET, 2024, 16 (09)
  • [46] Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing
    Zhao, Nan
    Ye, Zhiyang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (09) : 6949 - 6960
  • [47] Computation Offloading via Multi-Agent Deep Reinforcement Learning in Aerial Hierarchical Edge Computing Systems
    Wang, Yuanyuan
    Zhang, Chi
    Ge, Taiheng
    Pan, Miao
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5253 - 5266
  • [48] Optimization of Task Offloading Strategy for Mobile Edge Computing Based on Multi-Agent Deep Reinforcement Learning
    Lu, Haifeng
    Gu, Chunhua
    Luo, Fei
    Ding, Weichao
    Zheng, Shuai
    Shen, Yifan
    IEEE ACCESS, 2020, 8 : 202573 - 202584
  • [49] Reciprocal Velocity Obstacles for real-time multi-agent navigation
    van den Berg, Jur
    Lin, Ming
    Manocha, Dinesh
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1928 - 1935
  • [50] A multi-agent architecture for robotic systems in real-time environments
    Micacchi, C.
    Cohen, R.
    International Journal of Robotics and Automation, 2006, 21 (02): : 82 - 89