Deep Reinforcement Learning Based Latency Minimization for Mobile Edge Computing With Virtualization in Maritime UAV Communication Network

被引:101
|
作者
Liu, Ying [1 ]
Yan, Junjie [1 ]
Zhao, Xiaohui [1 ]
机构
[1] Jilin Univ, Coll Commun Engn, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Trajectory; Maritime communications; Servers; Resource management; Parallel processing; Optimization; Maritime communication; deep reinforcement learning (DRL); latency minimization; UAV trajectory design; mobile edge computing (MEC); virtual machine (VM); ENERGY-EFFICIENT; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; MANAGEMENT; COVERAGE; QOS;
D O I
10.1109/TVT.2022.3141799
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapid development of maritime activities has led to the emergence of more and more computation-intensive applications. In order to meet the huge demand for wireless communications in maritime environment, mobile edge computing (MEC) is considered as an effective solution to provide powerful computing capabilities for maritime terminals of resource scarcity or latency sensitive. A basic technology to implement MEC is virtual machine (VM) multiplexing, through which multi-task parallel computing on a server is realized. In this paper, a two-layer unmanned aerial vehicles (UAVs) maritime communication network with a centralized top-UAV (T-UAV) and a group of distributed bottom-UAVs (B-UAVs) is established and MEC is used on T-UAV. We aim to solve the latency minimization problem for both communication and computation in this maritime UAV swarm mobile edge computing network. We reformulate this problem into a Markov decision process (MDP), since it is a non-convex and multiply constrained but has the characteristics of MDP. Based on this MDP model, we take deep reinforcement learning (DRL) as our tool to propose a deep Q-network (DQN) and a deep deterministic policy gradient (DDPG) algorithms to optimize the trajectory of T-UAV and configuration of virtual machines (VMs). Using these two proposed algorithms, we can minimize the system latency. Simulation results show that the given solutions are valid and effective.
引用
收藏
页码:4225 / 4236
页数:12
相关论文
共 50 条
  • [31] Multiple Workflows Offloading Based on Deep Reinforcement Learning in Mobile Edge Computing
    Gao, Yongqiang
    Wang, Yanping
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT I, 2022, 13155 : 476 - 493
  • [32] Deep Reinforcement Learning-Based Server Selection for Mobile Edge Computing
    Liu, Heting
    Cao, Guohong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 13351 - 13363
  • [33] Task Offloading Optimization in Mobile Edge Computing based on Deep Reinforcement Learning
    Silva, Carlos
    Magaia, Naercio
    Grilo, Antonio
    PROCEEDINGS OF THE INT'L ACM CONFERENCE ON MODELING, ANALYSIS AND SIMULATION OF WIRELESS AND MOBILE SYSTEMS, MSWIM 2023, 2023, : 109 - 118
  • [34] Deep Reinforcement Learning Driven UAV-Assisted Edge Computing
    Zhang, Liang
    Jabbari, Bijan
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25449 - 25459
  • [35] Cooperative Multiagent Deep Reinforcement Learning Methods for UAV-Aided Mobile Edge Computing Networks
    Kim, Mintae
    Lee, Hoon
    Hwang, Sangwon
    Debbah, Merouane
    Lee, Inkyu
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (23): : 38040 - 38053
  • [36] A Deep Reinforcement Learning Approach for Collaborative Mobile Edge Computing
    Wu, Jiaqi
    Lin, Huang
    Liu, Huaize
    Gao, Lin
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 601 - 606
  • [37] Reinforcement Learning-Based Optimal Computing and Caching in Mobile Edge Network
    Qian, Yichen
    Wang, Rui
    Wu, Jun
    Tan, Bin
    Ren, Haoqi
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (10) : 2343 - 2355
  • [38] Dynamic and intelligent edge server placement based on deep reinforcement learning in mobile edge computing
    Jiang, Xiaohan
    Hou, Peng
    Zhu, Hongbin
    Li, Bo
    Wang, Zongshan
    Ding, Hongwei
    AD HOC NETWORKS, 2023, 145
  • [39] Secure Transmission for Multi-UAV-Assisted Mobile Edge Computing Based on Reinforcement Learning
    Lu, Weidang
    Mo, Yandan
    Feng, Yunqi
    Gao, Yuan
    Zhao, Nan
    Wu, Yuan
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (03): : 1270 - 1282
  • [40] Multi-UAV Mobile Edge Computing and Path Planning Platform Based on Reinforcement Learning
    Chang, Huan
    Chen, Yicheng
    Zhang, Baochang
    Doermann, David
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 489 - 498