Trajectory Design for Overlay UAV-to-Device Communications by Deep Reinforcement Learning

被引:6
|
作者
Wu, Fanyi [1 ]
Zhang, Hongliang [1 ,2 ]
Wu, Jianjun [1 ]
Song, Lingyang [1 ]
机构
[1] Peking Univ, Dept Elect, Beijing, Peoples R China
[2] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA
关键词
UAV-to-Device communications; cellular Internet of UAVs; trajectory design; deep reinforcement learning; NETWORKS;
D O I
10.1109/globecom38437.2019.9013973
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a cellular Internet of unmanned aerial vehicles (UAVs) where the sensory data can be transmitted to the mobile devices directly by overlaying UAV-to-Device (U2D) communications, or through the base station (BS) by cellular communications. Since the transmission modes of UAVs may influence their trajectories, we study the trajectory design problem for UAVs aiming to maximize the total utility in consideration of their transmission modes. This problem is a Markov decision problem (MDP) with a large state-action space, and thus, we propose a multi-UAV trajectory design algorithm using multi-agent deep reinforcement learning (DRL) to solve this problem. Simulation results show that our proposed algorithm can achieve a higher total utility than the single-agent method.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-Agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (07) : 4175 - 4189
  • [2] UAV-to-Device Underlay Communications: Age of Information Minimization by Multi-Agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Han, Zhu
    Poor, H. Vincent
    Song, Lingyang
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (07) : 4461 - 4475
  • [3] Trajectory Design for UAV Communications with No-Fly Zones by Deep Reinforcement Learning
    Liu, Zhenrong
    Zeng, Yuan
    Zhang, Wei
    Gong, Yi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [4] Cellular UAV-to-device communications: Joint trajectory, speed, and power optimisation
    Liu, Yaqin
    Wu, Fanyi
    Wu, Jianjun
    [J]. IET COMMUNICATIONS, 2021, 15 (10) : 1380 - 1391
  • [5] AoI Minimization for UAV-to-Device Underlay Communication by Multi-agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    Han, Zhu
    Poor, H. Vincent
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [6] Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications
    Zhang, Chiya
    Li, Zhukun
    He, Chunlong
    Wang, Kezhi
    Pan, Cunhua
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2398 - 2402
  • [7] Intelligent Trajectory Design in UAV-Aided Communications With Reinforcement Learning
    Yin, Sixing
    Zhao, Shuo
    Zhao, Yifei
    Yu, F. Richard
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (08) : 8227 - 8231
  • [8] Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks
    Zhao, Nan
    Cheng, Yiqiang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [9] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
    Liu, Jintao
    Zeng, Feng
    Wang, Wei
    Sheng, Zhichao
    Wei, Xinchen
    Cumanan, Kanapathippillai
    [J]. CHINA COMMUNICATIONS, 2022, 19 (09) : 26 - 36
  • [10] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
    Jintao Liu
    Feng Zeng
    Wei Wang
    Zhichao Sheng
    Xinchen Wei
    Kanapathippillai Cumanan
    [J]. China Communications, 2022, 19 (09) : 26 - 36