Joint Trajectory Control, Frequency Allocation, and Routing for UAV Swarm Networks: A Multi-Agent Deep Reinforcement Learning Approach

被引：1

作者：

Alam, Muhammad Morshed ^{[1
]}

Moh, Sangman ^{[2
]}

机构：

[1] Amer Int Univ Bangladesh, Dept Elect & Elect Engn, Dhaka 1229, Bangladesh

[2] Chosun Univ, Dept Comp Engn, Gwangju 61452, South Korea

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

新加坡国家研究基金会;

关键词：

Autonomous aerial vehicles; Trajectory; Routing; Radio spectrum management; Delays; Topology; Network topology; Multi-agent deep deterministic policy gradient; frequency allocation; routing; trajectory control; UAV swarm network; AD HOC NETWORKS; POWER-CONTROL; COMMUNICATION; DESIGN;

D O I：

10.1109/TMC.2024.3403890

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Collaborative unmanned aerial vehicle (UAV) swarm networks can effectively execute various emerging missions such as surveillance and communication coverage. However, due to high mobility and constrained transmission range, packet routing encounters mutual interferences, link breakages, and unexpected delays. In such networks, routing performance is coupled with trajectory control, frequency allocation, and relay selection. In this study, we propose a joint trajectory control, frequency allocation, and packet routing (JTFR) algorithm, in which link utility is maximized by considering the link stability, signal-to-interference-plus-noise ratio, queuing delay, and residual energy of UAVs. The proposed JTFR employs adaptive distributed multi-agent deep deterministic policy gradient coupled with the swarming behavior to obtain the optimal solution. For each UAV, an actor network is established by utilizing a long short-term memory-based state representation layer containing two-hop neighbor information to adopt the dynamic time-varying topology. Subsequently, a scalable multi-head attentional critic network is set up to adaptively adjust the actor network policy of each UAV by collaborating with neighbors. The extensive simulation results show that JTFR outperforms existing routing protocols by 30-60% less end-to-end delay, 15-32% better packet delivery ratio, and 20-46% less energy consumption.

引用

页码：11989 / 12005

页数：17

共 50 条

[41] Multi-Agent Deep Reinforcement Learning-Empowered Channel Allocation in Vehicular Networks
Kumar, Anitha Saravana
Zhao, Lian
Fernando, Xavier
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (02) : 1726 - 1736
[42] Joint Frequency Assignment and Power Allocation Based on Multi-Agent Deep Reinforcement Learning for Multi-Beam Satellite Systems
Li, Yuanjun
Yang, Dewei
Yang, Haowen
Kuang, Jingming
2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
[43] Coordinated Multi-Agent Reinforcement Learning for Swarm Battery Control
Ebell, Niklas
Pruckner, Marco
2018 IEEE CANADIAN CONFERENCE ON ELECTRICAL & COMPUTER ENGINEERING (CCECE), 2018,
[44] Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking
Xia, Zhaoyue
Du, Jun
Wang, Jingjing
Jiang, Chunxiao
Ren, Yong
Li, Gang
Han, Zhu
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 931 - 945
[45] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
Lv, Zefang
Xiao, Liang
Du, Yousong
Niu, Guohang
Xing, Chengwen
Xu, Wenyuan
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
[46] UAV Frequency-based Crowdsensing Using Grouping Multi-agent Deep Reinforcement Learning
Cui ZHANG
En WANG
Funing YANG
Yongjian YANG
Nan JIANG
计算机科学, 2023, 50 (02) : 57 - 68
[47] Improving Cooperative Multi-Target Tracking Control for UAV Swarm Using Multi-Agent Reinforcement Learning
Yue, Longfei
Lv, Maolong
Yan, Mengda
Zhao, Xiaoru
Wu, Ao
Li, Leyan
Zuo, Jialiang
2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 179 - 186
[48] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
Koratamaddi, Prahlad
Wadhwani, Karan
Gupta, Mridul
Sanjeevi, Sriram G.
CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
[49] Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective
Lee, Seungmin
Lim, Suhyeon
Chae, Seong Ho
Jung, Bang Chul
Park, Chan Yi
Lee, Howon
IEEE ACCESS, 2022, 10 : 39555 - 39565
[50] Multi-Agent Low-Bias Reinforcement Learning for Resource Allocation in UAV-Assisted Networks
Zhou, Shiyang
Cheng, Yufan
Lei, Xia
2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 1011 - 1016

← 1 2 3 4 5 →