Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search

被引：32

作者：

Luo, Quyuan ^{[1
]}

Luan, Tom H. ^{[2
]}

Shi, Weisong ^{[3
]}

Fan, Pingzhi ^{[1
]}

机构：

[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Prov Key Lab Informat Coding & Transmiss, Chengdu 611756, Peoples R China

[2] Xidian Univ, Sch Cyber Engn, Xian 710071, Peoples R China

[3] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2023年 / 41卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Search problems; Uncertainty; Edge computing; Autonomous aerial vehicles; Trajectory; Servers; Unmanned aerial vehicle; cooperative target search; edge computing; computation offloading; deep reinforcement learning (DRL); RESOURCE-ALLOCATION; INDUSTRIAL INTERNET; OPTIMIZATION; NETWORKS; VEHICLES; MODEL;

D O I：

10.1109/JSAC.2022.3228558

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned aerial vehicles (UAVs) are widely used for surveillance and monitoring to complete target search tasks. However, the short battery life and moderate computational capability hinder UAVs to process computation-intensive tasks. The emerging edge computing technologies can alleviate this problem by offloading tasks to the ground edge servers. How to evaluate the search process so as to make optimal offloading decisions and make optimal flying trajectories represent fundamental research challenges. In this paper, we propose to utilize the concept of uncertainty to evaluate the search process, which reflects the reliability of the target search results. Thereafter, we propose a deep reinforcement learning (DRL) technique to jointly make optimal computation offloading decisions and flying orientation choices for multi-UAV cooperative target search. Specifically, we first formulate an uncertainty minimization problem based on the established system model. By introducing a reward function, we prove that the uncertainty minimization problem is equivalent to a reward maximization problem, which is further analyzed by a Markov decision process (MDP). To obtain the optimal task offloading decisions and flying orientation choices, a deep Q-network (DQN) based DRL architecture with a separated Q-network is then proposed. Finally, extensive simulations validate the effectiveness of the proposed techniques, and comprehensive discussions on how different parameters affect the search performance are given.

引用

页码：504 / 520

页数：17

共 50 条

[1] Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking
Moon, Jiseon
Papaioannou, Savvas
Laoudias, Christos
Kolios, Panayiotis
Kim, Sunwoo
[J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (20) : 15441 - 15455
[2] Trajectory Planning and Resource Allocation for Multi-UAV Cooperative Computation
Xu, Wenlong
Zhang, Tiankui
Mu, Xidong
Liu, Yuanwei
Wang, Yapeng
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (07) : 4305 - 4318
[3] Computation offloading over multi-UAV MEC network: A distributed deep reinforcement learning approach
Wei, Dawei
Ma, Jianfeng
Luo, Linbo
Wang, Yunbo
He, Lei
Li, Xinghua
[J]. COMPUTER NETWORKS, 2021, 199
[4] Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning
Xing, Xiaojun
Zhou, Zhiwei
Li, Yan
Xiao, Bing
Xun, Yilin
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12484 - 12499
[5] Multi-UAV Information Fusion and Cooperative Trajectory Optimization in Target Search
Yao, Peng
Wei, Xin
[J]. IEEE SYSTEMS JOURNAL, 2022, 16 (03): : 4325 - 4333
[6] Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
Zhang, Chi Ya
Liang, Shi Yuan
He, Chun Long
Wang, Ke Zhi
[J]. Journal of Communications and Information Networks, 2022, 7 (02): : 192 - 201
[7] Multi-UAV Cooperative Search Planning Algorithm Based on Dynamic Target Probability Model
Ao, Zihang
Zhang, Yulong
Huang, Jing
Lin, Yichen
Zhou, Xiaoden
Zhang, Youmin
[J]. 2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 543 - 548
[8] Multi-Agent Deep Reinforcement Learning-Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
Wang, Liang
Wang, Kezhi
Pan, Cunhua
Xu, Wei
Aslam, Nauman
Hanzo, Lajos
[J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 73 - 84
[9] Bayesian Optimization Enhanced Deep Reinforcement Learning for Trajectory Planning and Network Formation in Multi-UAV Networks
Gong, Shimin
Wang, Meng
Gu, Bo
Zhang, Wenjie
Dinh Thai Hoang
Niyato, Dusit
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (08) : 10933 - 10948
[10] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
Westheider, Jonas
Rueckin, Julius
Popovic, Marija
[J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656

← 1 2 3 4 5 →