Joint Trajectory Control, Frequency Allocation, and Routing for UAV Swarm Networks: A Multi-Agent Deep Reinforcement Learning Approach

被引：1

作者：

Alam, Muhammad Morshed ^{[1
]}

Moh, Sangman ^{[2
]}

机构：

[1] Amer Int Univ Bangladesh, Dept Elect & Elect Engn, Dhaka 1229, Bangladesh

[2] Chosun Univ, Dept Comp Engn, Gwangju 61452, South Korea

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Autonomous aerial vehicles; Trajectory; Routing; Radio spectrum management; Delays; Topology; Network topology; Multi-agent deep deterministic policy gradient; frequency allocation; routing; trajectory control; UAV swarm network; AD HOC NETWORKS; POWER-CONTROL; COMMUNICATION; DESIGN;

D O I：

10.1109/TMC.2024.3403890

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Collaborative unmanned aerial vehicle (UAV) swarm networks can effectively execute various emerging missions such as surveillance and communication coverage. However, due to high mobility and constrained transmission range, packet routing encounters mutual interferences, link breakages, and unexpected delays. In such networks, routing performance is coupled with trajectory control, frequency allocation, and relay selection. In this study, we propose a joint trajectory control, frequency allocation, and packet routing (JTFR) algorithm, in which link utility is maximized by considering the link stability, signal-to-interference-plus-noise ratio, queuing delay, and residual energy of UAVs. The proposed JTFR employs adaptive distributed multi-agent deep deterministic policy gradient coupled with the swarming behavior to obtain the optimal solution. For each UAV, an actor network is established by utilizing a long short-term memory-based state representation layer containing two-hop neighbor information to adopt the dynamic time-varying topology. Subsequently, a scalable multi-head attentional critic network is set up to adaptively adjust the actor network policy of each UAV by collaborating with neighbors. The extensive simulation results show that JTFR outperforms existing routing protocols by 30-60% less end-to-end delay, 15-32% better packet delivery ratio, and 20-46% less energy consumption.

引用

页码：11989 / 12005

页数：17

共 50 条

[11] Power Allocation and Energy Cooperation for UAV-Enabled MmWave Networks: A Multi-Agent Deep Reinforcement Learning Approach
Domingo, Mari Carmen
SENSORS, 2022, 22 (01)
[12] Joint UAV Trajectory and RadCom Task Schedule for IVNs: A Game-Embedding Multi-Agent Deep Reinforcement Learning Approach
Cheng, Sike
Lin, Xiangbo
Li, Xuanheng
Wang, Jingjing
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (01) : 181 - 196
[13] UAV Swarm Cooperative Target Search: A Multi-Agent Reinforcement Learning Approach
Hou, Yukai
Zhao, Jin
Zhang, Rongqing
Cheng, Xiang
Yang, Liuqing
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 568 - 578
[14] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
Cui, Jingjing
Liu, Yuanwei
Nallanathan, Arumugam
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
[15] Routing with Graph Convolutional Networks and Multi-Agent Deep Reinforcement Learning
Bhavanasi, Sai Shreyas
Pappone, Lorenzo
Esposito, Flavio
2022 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS (IEEE NFV-SDN), 2022, : 72 - 77
[16] Joint Communication-Motion Planning for UAV Swarm against Jamming with Multi-Agent Deep Reinforcement Learning
Guo, Zhenxin
Liu, Yiming
Wang, Yipeng
Meng, Yue
Liu, Baoling
IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC, 2024,
[17] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
Dai, Chen
Zhu, Kun
Hossain, Ekram
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070
[18] Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning
Guo, Delin
Tang, Lan
Zhang, Xinggan
Liang, Ying-Chang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13124 - 13138
[19] Distributed Safe Multi-Agent Reinforcement Learning: Joint Design of THz-Enabled UAV Trajectory and Channel Allocation
Termehchi, Atefeh
Syed, Aisha
Kennedy, William Sean
Erol-Kantarci, Melike
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 14172 - 14186
[20] Multi-Agent Deep Reinforcement Learning Based UAV Trajectory Optimization for Differentiated Services
Ning, Zhaolong
Yang, Yuxuan
Wang, Xiaojie
Song, Qingyang
Guo, Lei
Jamalipour, Abbas
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5818 - 5834

← 1 2 3 4 5 →