Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-Agent Deep Reinforcement Learning

被引:54
|
作者
Wu, Fanyi [1 ]
Zhang, Hongliang [1 ,2 ]
Wu, Jianjun [1 ]
Song, Lingyang [1 ]
机构
[1] Peking Univ, Dept Elect Engn, Beijing 100871, Peoples R China
[2] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
基金
中国国家自然科学基金;
关键词
Sensors; Mobile handsets; Trajectory; Internet; Quality of service; Cellular networks; Machine learning; UAV-to-Device communications; cellular Internet of UAVs; trajectory design; deep reinforcement learning; OPTIMIZATION; NETWORKS; INTERNET;
D O I
10.1109/TCOMM.2020.2986289
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the current unmanned aircraft systems (UASs) for sensing services, unmanned aerial vehicles (UAVs) transmit their sensory data to terrestrial mobile devices over the unlicensed spectrum. However, the interference from surrounding terminals is uncontrollable due to the opportunistic channel access. In this paper, we consider a cellular Internet of UAVs to guarantee the Quality-of-Service (QoS), where the sensory data can be transmitted to the mobile devices either by UAV-to-Device (U2D) communications over cellular networks, or directly through the base station (BS). Since UAVs' sensing and transmission may influence their trajectories, we study the trajectory design problem for UAVs in consideration of their sensing and transmission. This is a Markov decision problem (MDP) with a large state-action space, and thus, we utilize multi-agent deep reinforcement learning (DRL) to approximate the state-action space, and then propose a multi-UAV trajectory design algorithm to solve this problem. Simulation results show that our proposed algorithm can achieve a higher total utility than policy gradient algorithm and single-agent algorithm.
引用
下载
收藏
页码:4175 / 4189
页数:15
相关论文
共 50 条
  • [1] Trajectory Design for Overlay UAV-to-Device Communications by Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [2] UAV-to-Device Underlay Communications: Age of Information Minimization by Multi-Agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Han, Zhu
    Poor, H. Vincent
    Song, Lingyang
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (07) : 4461 - 4475
  • [3] AoI Minimization for UAV-to-Device Underlay Communication by Multi-agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    Han, Zhu
    Poor, H. Vincent
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [4] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
    Zhang, Yu
    Zhuang, Zirui
    Gao, Feifei
    Wang, Jingyu
    Han, Zhu
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [5] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [6] UAV-Enabled Secure Communications by Multi-Agent Deep Reinforcement Learning
    Zhang, Yu
    Mou, Zhiyu
    Gao, Feifei
    Jiang, Jing
    Ding, Ruijin
    Han, Zhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 11599 - 11611
  • [7] Cellular UAV-to-device communications: Joint trajectory, speed, and power optimisation
    Liu, Yaqin
    Wu, Fanyi
    Wu, Jianjun
    IET COMMUNICATIONS, 2021, 15 (10) : 1380 - 1391
  • [8] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Zhou, Xuanhan
    Xiong, Jun
    Zhao, Haitao
    Liu, Xiaoran
    Ren, Baoquan
    Zhang, Xiaochen
    Wei, Jibo
    Yin, Hao
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [9] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Xuanhan ZHOU
    Jun XIONG
    Haitao ZHAO
    Xiaoran LIU
    Baoquan REN
    Xiaochen ZHANG
    Jibo WEI
    Hao YIN
    Science China(Information Sciences), 2024, 67 (03) : 225 - 245
  • [10] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Xuanhan Zhou
    Jun Xiong
    Haitao Zhao
    Xiaoran Liu
    Baoquan Ren
    Xiaochen Zhang
    Jibo Wei
    Hao Yin
    Science China Information Sciences, 2024, 67