Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search

被引:32
|
作者
Luo, Quyuan [1 ]
Luan, Tom H. [2 ]
Shi, Weisong [3 ]
Fan, Pingzhi [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Prov Key Lab Informat Coding & Transmiss, Chengdu 611756, Peoples R China
[2] Xidian Univ, Sch Cyber Engn, Xian 710071, Peoples R China
[3] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA
基金
中国国家自然科学基金;
关键词
Task analysis; Search problems; Uncertainty; Edge computing; Autonomous aerial vehicles; Trajectory; Servers; Unmanned aerial vehicle; cooperative target search; edge computing; computation offloading; deep reinforcement learning (DRL); RESOURCE-ALLOCATION; INDUSTRIAL INTERNET; OPTIMIZATION; NETWORKS; VEHICLES; MODEL;
D O I
10.1109/JSAC.2022.3228558
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unmanned aerial vehicles (UAVs) are widely used for surveillance and monitoring to complete target search tasks. However, the short battery life and moderate computational capability hinder UAVs to process computation-intensive tasks. The emerging edge computing technologies can alleviate this problem by offloading tasks to the ground edge servers. How to evaluate the search process so as to make optimal offloading decisions and make optimal flying trajectories represent fundamental research challenges. In this paper, we propose to utilize the concept of uncertainty to evaluate the search process, which reflects the reliability of the target search results. Thereafter, we propose a deep reinforcement learning (DRL) technique to jointly make optimal computation offloading decisions and flying orientation choices for multi-UAV cooperative target search. Specifically, we first formulate an uncertainty minimization problem based on the established system model. By introducing a reward function, we prove that the uncertainty minimization problem is equivalent to a reward maximization problem, which is further analyzed by a Markov decision process (MDP). To obtain the optimal task offloading decisions and flying orientation choices, a deep Q-network (DQN) based DRL architecture with a separated Q-network is then proposed. Finally, extensive simulations validate the effectiveness of the proposed techniques, and comprehensive discussions on how different parameters affect the search performance are given.
引用
收藏
页码:504 / 520
页数:17
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
    Moon, Jiseon
    Papaioannou, Savvas
    Laoudias, Christos
    Kolios, Panayiotis
    Kim, Sunwoo
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15441 - 15455
  • [2] Trajectory Planning and Resource Allocation for Multi-UAV Cooperative Computation
    Xu, Wenlong
    Zhang, Tiankui
    Mu, Xidong
    Liu, Yuanwei
    Wang, Yapeng
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (07) : 4305 - 4318
  • [3] Computation offloading over multi-UAV MEC network: A distributed deep reinforcement learning approach
    Wei, Dawei
    Ma, Jianfeng
    Luo, Linbo
    Wang, Yunbo
    He, Lei
    Li, Xinghua
    [J]. COMPUTER NETWORKS, 2021, 199
  • [4] Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning
    Xing, Xiaojun
    Zhou, Zhiwei
    Li, Yan
    Xiao, Bing
    Xun, Yilin
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12484 - 12499
  • [5] Multi-UAV Information Fusion and Cooperative Trajectory Optimization in Target Search
    Yao, Peng
    Wei, Xin
    [J]. IEEE SYSTEMS JOURNAL, 2022, 16 (03): : 4325 - 4333
  • [6] Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
    Zhang, Chi Ya
    Liang, Shi Yuan
    He, Chun Long
    Wang, Ke Zhi
    [J]. Journal of Communications and Information Networks, 2022, 7 (02): : 192 - 201
  • [7] Multi-UAV Cooperative Search Planning Algorithm Based on Dynamic Target Probability Model
    Ao, Zihang
    Zhang, Yulong
    Huang, Jing
    Lin, Yichen
    Zhou, Xiaoden
    Zhang, Youmin
    [J]. 2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 543 - 548
  • [8] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 73 - 84
  • [9] Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks
    Gong, Shimin
    Wang, Meng
    Gu, Bo
    Zhang, Wenjie
    Dinh Thai Hoang
    Niyato, Dusit
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 10933 - 10948
  • [10] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656