Joint Trajectory Control, Frequency Allocation, and Routing for UAV Swarm Networks: A Multi-Agent Deep Reinforcement Learning Approach

被引:1
|
作者
Alam, Muhammad Morshed [1 ]
Moh, Sangman [2 ]
机构
[1] Amer Int Univ Bangladesh, Dept Elect & Elect Engn, Dhaka 1229, Bangladesh
[2] Chosun Univ, Dept Comp Engn, Gwangju 61452, South Korea
基金
新加坡国家研究基金会;
关键词
Autonomous aerial vehicles; Trajectory; Routing; Radio spectrum management; Delays; Topology; Network topology; Multi-agent deep deterministic policy gradient; frequency allocation; routing; trajectory control; UAV swarm network; AD HOC NETWORKS; POWER-CONTROL; COMMUNICATION; DESIGN;
D O I
10.1109/TMC.2024.3403890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collaborative unmanned aerial vehicle (UAV) swarm networks can effectively execute various emerging missions such as surveillance and communication coverage. However, due to high mobility and constrained transmission range, packet routing encounters mutual interferences, link breakages, and unexpected delays. In such networks, routing performance is coupled with trajectory control, frequency allocation, and relay selection. In this study, we propose a joint trajectory control, frequency allocation, and packet routing (JTFR) algorithm, in which link utility is maximized by considering the link stability, signal-to-interference-plus-noise ratio, queuing delay, and residual energy of UAVs. The proposed JTFR employs adaptive distributed multi-agent deep deterministic policy gradient coupled with the swarming behavior to obtain the optimal solution. For each UAV, an actor network is established by utilizing a long short-term memory-based state representation layer containing two-hop neighbor information to adopt the dynamic time-varying topology. Subsequently, a scalable multi-head attentional critic network is set up to adaptively adjust the actor network policy of each UAV by collaborating with neighbors. The extensive simulation results show that JTFR outperforms existing routing protocols by 30-60% less end-to-end delay, 15-32% better packet delivery ratio, and 20-46% less energy consumption.
引用
收藏
页码:11989 / 12005
页数:17
相关论文
共 50 条
  • [41] Multi-Agent Deep Reinforcement Learning-Empowered Channel Allocation in Vehicular Networks
    Kumar, Anitha Saravana
    Zhao, Lian
    Fernando, Xavier
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (02) : 1726 - 1736
  • [42] Joint Frequency Assignment and Power Allocation Based on Multi-Agent Deep Reinforcement Learning for Multi-Beam Satellite Systems
    Li, Yuanjun
    Yang, Dewei
    Yang, Haowen
    Kuang, Jingming
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [43] Coordinated Multi-Agent Reinforcement Learning for Swarm Battery Control
    Ebell, Niklas
    Pruckner, Marco
    2018 IEEE CANADIAN CONFERENCE ON ELECTRICAL & COMPUTER ENGINEERING (CCECE), 2018,
  • [44] Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking
    Xia, Zhaoyue
    Du, Jun
    Wang, Jingjing
    Jiang, Chunxiao
    Ren, Yong
    Li, Gang
    Han, Zhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (01) : 931 - 945
  • [45] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [46] UAV Frequency-based Crowdsensing Using Grouping Multi-agent Deep Reinforcement Learning
    Cui ZHANG
    En WANG
    Funing YANG
    Yongjian YANG
    Nan JIANG
    计算机科学, 2023, 50 (02) : 57 - 68
  • [47] Improving Cooperative Multi-Target Tracking Control for UAV Swarm Using Multi-Agent Reinforcement Learning
    Yue, Longfei
    Lv, Maolong
    Yan, Mengda
    Zhao, Xiaoru
    Wu, Ao
    Li, Leyan
    Zuo, Jialiang
    2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 179 - 186
  • [48] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
    Koratamaddi, Prahlad
    Wadhwani, Karan
    Gupta, Mridul
    Sanjeevi, Sriram G.
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410
  • [49] Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective
    Lee, Seungmin
    Lim, Suhyeon
    Chae, Seong Ho
    Jung, Bang Chul
    Park, Chan Yi
    Lee, Howon
    IEEE ACCESS, 2022, 10 : 39555 - 39565
  • [50] Multi-Agent Low-Bias Reinforcement Learning for Resource Allocation in UAV-Assisted Networks
    Zhou, Shiyang
    Cheng, Yufan
    Lei, Xia
    2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 1011 - 1016