Deep Reinforcement Learning for UAV-Assisted Spectrum Sharing Under Partial Observability

被引:0
|
作者
Zhang, Sigen [1 ]
Wang, Zhe [1 ]
Gao, Guanyu [1 ]
Li, Jun [2 ]
Zhang, Jie [2 ]
Yin, Ziyan [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; dynamic spectrum sharing; partially observable Markov decision process; deep reinforcement learning; ENERGY-EFFICIENT; TRAJECTORY DESIGN; NETWORKS; COMMUNICATION; ALLOCATION; 5G;
D O I
10.1109/VTC2023-Fall60731.2023.10333853
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a dynamic spectrum sharing scheme in an unmanned aerial vehicle (UAV) assisted cognitive radio network. The UAV serves as a secondary base station to provide communication services to multiple secondary users (SUs) by adaptively utilizing the spatio-temporal spectrum opportunities of multiple device-to-device primary users (PUs), where each PU's spectrum occupancy follows a two-state Markov process. We jointly optimize the UAV's trajectory and user association to maximize the expectation of its cumulative energy efficiency subject to the interference constraint of the PUs. We formulate this problem as a partially observable Markov decision process (POMDP), where the UAV can only observe the spectrum occupancy status of the adjacent PUs. Due to the lack of the PUs' spectrum occupancy statistics, we propose a model-free reinforcement learning algorithm named partially observable double deep Q network (PO-DDQN) to obtain the near-optimal spectrum sharing policy. Simulation results show that our proposed algorithm outperforms the baseline policy gradient (PG) algorithm in terms of convergence speed and the UAV's energy efficiency. Additionally, the spectrum utilization efficiency can be further enhanced when the UAV has wider observation radius, or if the PUs' spectrum occupancy exhibits stronger temporal correlation.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Federated deep reinforcement learning based trajectory design for UAV-assisted networks with mobile ground devices
    Gao, Yunfei
    Liu, Mingliu
    Yuan, Xiaopeng
    Hu, Yulin
    Sun, Peng
    Schmeink, Anke
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [42] UAV-assisted task offloading system using dung beetle optimization algorithm & deep reinforcement learning
    Zhang, Degan
    Zhang, Zhihao
    Zhang, Jie
    Zhang, Ting
    Zhang, Lei
    Chen, Hongtao
    AD HOC NETWORKS, 2024, 156
  • [43] Caching Placement Optimization in UAV-Assisted Cellular Networks: A Deep Reinforcement Learning-Based Framework
    Wang, Yun
    Fu, Shu
    Yao, Changhua
    Zhang, Haijun
    Yu, Fei Richard
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (08) : 1359 - 1363
  • [44] A novel energy-efficiency framework for UAV-assisted networks using adaptive deep reinforcement learning
    Seerangan, Koteeswaran
    Nandagopal, Malarvizhi
    Govindaraju, Tamilmani
    Manogaran, Nalini
    Balusamy, Balamurugan
    Selvarajan, Shitharth
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [45] UAV-Assisted Fair Communication for Mobile Networks: A Multi-Agent Deep Reinforcement Learning Approach
    Zhou, Yi
    Jin, Zhanqi
    Shi, Huaguang
    Wang, Zhangyun
    Lu, Ning
    Liu, Fuqiang
    REMOTE SENSING, 2022, 14 (22)
  • [46] Deep Reinforcement Learning-Empowered Trajectory and Resource Allocation Optimization for UAV-Assisted MEC Systems
    Sun, Haowen
    Chen, Ming
    Pan, Yijin
    Cang, Yihan
    Zhao, Jiahui
    Sun, Yuanzhi
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (07) : 1823 - 1827
  • [47] Multi-Agent Deep Reinforcement Learning for Task Offloading in UAV-Assisted Mobile Edge Computing
    Zhao, Nan
    Ye, Zhiyang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (09) : 6949 - 6960
  • [48] Task Offloading and Trajectory Control for UAV-Assisted Mobile Edge Computing Using Deep Reinforcement Learning
    Zhang, Lu
    Zhang, Zi-Yan
    Min, Luo
    Tang, Chao
    Zhang, Hong-Ying
    Wang, Ya-Hong
    Cai, Peng
    IEEE ACCESS, 2021, 9 : 53708 - 53719
  • [49] Blockchain-Based Spectrum Sharing Algorithm for UAV-Assisted Relay System
    Huang, Fukang
    Zhu, Qi
    ELECTRONICS, 2024, 13 (18)
  • [50] Edge Computing Task Offloading Optimization for a UAV-Assisted Internet of Vehicles via Deep Reinforcement Learning
    Yan, Ming
    Xiong, Rui
    Wang, Yan
    Li, Chunguo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (04) : 5647 - 5658