Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-Assisted Mobile Edge Computing

被引:161
|
作者
Wang, Liang [1 ]
Wang, Kezhi [1 ]
Pan, Cunhua [2 ]
Xu, Wei [3 ,4 ]
Aslam, Nauman [1 ]
Nallanathan, Arumugam [2 ]
机构
[1] Northumbria Univ, Dept Comp & Informant Sci, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
[4] Zhengzhou Univ, Henan Joint Int Res Lab Intelligent Networking &, Zhengzhou 450001, Peoples R China
关键词
Trajectory; Unmanned aerial vehicles; Radio access technologies; Task analysis; Resource management; Reinforcement learning; Cats; Deep reinforcement learning; mobile edge computing; Unmanned Aerial Vehicle (UAV); trajectory control; user association; ENERGY-EFFICIENT; COMMUNICATION DESIGN; RESOURCE-ALLOCATION; FAIR COMMUNICATION; NETWORKS; OPTIMIZATION; ALTITUDE; CLOUD;
D O I
10.1109/TMC.2021.3059691
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a platform of flying mobile edge computing (F-MEC), where unmanned aerial vehicles (UAVs) serve as equipment providing computation resource, and they enable task offloading from user equipment (UE). We aim to minimize energy consumption of all UEs via optimizing user association, resource allocation and the trajectory of UAVs. To this end, we first propose a Convex optimizAtion based Trajectory control algorithm (CAT), which solves the problem in an iterative way by using block coordinate descent (BCD) method. Then, to make the real-time decision while taking into account the dynamics of the environment (i.e., UAV may take off from different locations), we propose a deep Reinforcement leArning based trajectory control algorithm (RAT). In RAT, we apply the Prioritized Experience Replay (PER) to improve the convergence of the training procedure. Different from the convex optimization based algorithm which may be susceptible to the initial points and requires iterations, RAT can be adapted to any taking off points of the UAVs and can obtain the solution more rapidly than CAT once training process has been completed. Simulation results show that the proposed CAT and RAT achieve the considerable performance and both outperform traditional algorithms.
引用
收藏
页码:3536 / 3550
页数:15
相关论文
共 50 条
  • [31] UAV-Assisted Relaying and Edge Computing: Scheduling and Trajectory Optimization
    Hu, Xiaoyan
    Wong, Kai-Kit
    Yang, Kun
    Zheng, Zhongbin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (10) : 4738 - 4752
  • [32] UAV-Assisted Wireless Powered Cooperative Mobile Edge Computing: Joint Offloading, CPU Control, and Trajectory Optimization
    Liu, Yuan
    Xiong, Ke
    Ni, Qiang
    Fan, Pingyi
    Ben Letaief, Khaled
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) : 2777 - 2790
  • [33] Machine Learning Driven UAV-assisted Edge Computing
    Zhang, Liang
    Jabbari, Bijan
    Ansari, Nirwan
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2220 - 2225
  • [34] Blockchain-Integrated UAV-Assisted Mobile Edge Computing: Trajectory Planning and Resource Allocation
    Wang, Die
    Jia, Yunjian
    Dong, Mianxiong
    Ota, Kaoru
    Liang, Liang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 1263 - 1275
  • [35] Deep Reinforcement Learning Based Freshness-Aware Path Planning for UAV-Assisted Edge Computing Networks with Device Mobility
    Peng, Yingsheng
    Liu, Yong
    Li, Dong
    Zhang, Han
    REMOTE SENSING, 2022, 14 (16)
  • [36] Energy-Efficient UAV-Assisted Mobile Edge Computing: Resource Allocation and Trajectory Optimization
    Li, Mushu
    Cheng, Nan
    Gao, Jie
    Wang, Yinlu
    Zhao, Lian
    Shen, Xuemin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 3424 - 3438
  • [37] Analysis and prediction of UAV-assisted mobile edge computing systems
    Wang, Xiong
    Yang, Zhijun
    Ding, Hongwei
    Guan, Zheng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (12) : 21267 - 21291
  • [38] Deep Reinforcement Learning for Jointly Resource Allocation and Trajectory Planning in UAV-Assisted Networks
    Jwaifel, Arwa Mahmoud
    Van Do, Tien
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 71 - 83
  • [39] Dynamic Trajectory Design for Multi-UAV-Assisted Mobile Edge Computing
    Wang, Zhuwei
    Wang, Haowei
    Liu, Lihan
    Sun, Enchang
    Zhang, Haijun
    Li, Zhidu
    Fang, Chao
    Li, Meng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (03) : 4684 - 4697
  • [40] Collaborative Service Provisioning for UAV-Assisted Mobile Edge Computing
    Yuben QU
    Zhenhua WEI
    Zhen QIN
    Tao WU
    Jinghao MA
    Haipeng DAI
    Chao DONG
    Chinese Journal of Electronics, 2024, 33 (06) : 1504 - 1514