Multi-agent DRL for edge computing: A real-time proportional compute offloading

被引：3

作者：

Jia, Kunkun ^{[1
]}

Xia, Hui ^{[1
]}

Zhang, Rui ^{[1
]}

Sun, Yue ^{[1
]}

Wang, Kai ^{[2
]}

机构：

[1] Ocean Univ China, Coll Comp Sci & Technol, Qingdao 266100, Peoples R China

[2] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264200, Peoples R China

来源：

COMPUTER NETWORKS | 2024年 / 252卷

基金：

中国国家自然科学基金;

关键词：

Computation offloading; Edge computing; Deep reinforcement learning; Orthogonal frequency division multiple-access; REINFORCEMENT; INTERNET; THINGS; AWARE;

D O I：

10.1016/j.comnet.2024.110665

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the Industrial Internet of Things, devices with limited computing power and energy storage often rely on offloading tasks to edge servers for processing. However, existing methods are plagued by the high cost of device communication and unstable training processes. Consequently, Deep reinforcement learning (DRL) has emerged as a promising solution to tackle the computation offloading problem. In this paper, we propose a framework called multi-agent twin delayed shared deep deterministic policy gradient algorithm (MASTD3) based on DRL. Firstly, we formulate the task offloading conundrum as a long-term optimization problem, which aids in mitigating the challenge of deciding between local or remote task execution by a device, leading to more effective task offloading management. Secondly, we enhance MASTD3 by introducing a priority experience replay buffer mechanism and a model sample replay buffer mechanism, thus improving sample utilization and overcoming the cold-start problem associated with long-term optimization. Moreover, we refine the actor critic structure, enabling all agents to share the same critic network. This modification accelerates convergence speed during the training process and reduces computational costs during runtime. Finally, experimental results demonstrate that MASTD3 effectively addresses the proportional offloading problem, which is optimized by 44.32%, 29.26%, and 17.47% compared to DDPQN, MADDPG, and FLoadNet.

引用

页数：13

共 50 条

[41] RtDS: real-time distributed strategy for multi-period task offloading in vehicular edge computing environment
Liu, Chunhui
Liu, Kai
Ren, Hualing
Xu, Xincao
Xie, Ruitao
Cao, Jingjing
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17): : 12373 - 12387
[42] DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields
Patel, Dipam
Phu Pham
Bera, Aniket
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 5050 - 5055
[43] Multi-agent reinforcement learning for task offloading with hybrid decision space in multi-access edge computing
Wang, Ji
Zhang, Miao
Yin, Quanjun
Yin, Lujia
Peng, Yong
AD HOC NETWORKS, 2025, 166
[44] Multi-Agent DRL-based Multi-Objective Demand Response Optimization for Real-Time Energy Management in Smart Homes
Abishu, Hayla Nahom
Seid, Abegaz Mohammed
Marquez-Sanchez, Sergio
Fernandez, Javier Hernandez
Corchado, Juan Manuel
Erbad, Aiman
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 1210 - 1217
[45] A Task Offloading and Resource Allocation Strategy Based on Multi-Agent Reinforcement Learning in Mobile Edge Computing
Jiang, Guiwen
Huang, Rongxi
Bao, Zhiming
Wang, Gaocai
FUTURE INTERNET, 2024, 16 (09)
[46] Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing
Zhao, Nan
Ye, Zhiyang
Pei, Yiyang
Liang, Ying-Chang
Niyato, Dusit
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (09) : 6949 - 6960
[47] Computation Offloading via Multi-Agent Deep Reinforcement Learning in Aerial Hierarchical Edge Computing Systems
Wang, Yuanyuan
Zhang, Chi
Ge, Taiheng
Pan, Miao
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5253 - 5266
[48] Optimization of Task Offloading Strategy for Mobile Edge Computing Based on Multi-Agent Deep Reinforcement Learning
Lu, Haifeng
Gu, Chunhua
Luo, Fei
Ding, Weichao
Zheng, Shuai
Shen, Yifan
IEEE ACCESS, 2020, 8 : 202573 - 202584
[49] Reciprocal Velocity Obstacles for real-time multi-agent navigation
van den Berg, Jur
Lin, Ming
Manocha, Dinesh
2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1928 - 1935
[50] A multi-agent architecture for robotic systems in real-time environments
Micacchi, C.
Cohen, R.
International Journal of Robotics and Automation, 2006, 21 (02): : 82 - 89

← 1 2 3 4 5 →