Deep Reinforcement Learning for UAV-Assisted Spectrum Sharing Under Partial Observability

被引:0
|
作者
Zhang, Sigen [1 ]
Wang, Zhe [1 ]
Gao, Guanyu [1 ]
Li, Jun [2 ]
Zhang, Jie [2 ]
Yin, Ziyan [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; dynamic spectrum sharing; partially observable Markov decision process; deep reinforcement learning; ENERGY-EFFICIENT; TRAJECTORY DESIGN; NETWORKS; COMMUNICATION; ALLOCATION; 5G;
D O I
10.1109/VTC2023-Fall60731.2023.10333853
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a dynamic spectrum sharing scheme in an unmanned aerial vehicle (UAV) assisted cognitive radio network. The UAV serves as a secondary base station to provide communication services to multiple secondary users (SUs) by adaptively utilizing the spatio-temporal spectrum opportunities of multiple device-to-device primary users (PUs), where each PU's spectrum occupancy follows a two-state Markov process. We jointly optimize the UAV's trajectory and user association to maximize the expectation of its cumulative energy efficiency subject to the interference constraint of the PUs. We formulate this problem as a partially observable Markov decision process (POMDP), where the UAV can only observe the spectrum occupancy status of the adjacent PUs. Due to the lack of the PUs' spectrum occupancy statistics, we propose a model-free reinforcement learning algorithm named partially observable double deep Q network (PO-DDQN) to obtain the near-optimal spectrum sharing policy. Simulation results show that our proposed algorithm outperforms the baseline policy gradient (PG) algorithm in terms of convergence speed and the UAV's energy efficiency. Additionally, the spectrum utilization efficiency can be further enhanced when the UAV has wider observation radius, or if the PUs' spectrum occupancy exhibits stronger temporal correlation.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Federated Deep Reinforcement Learning-Based Intelligent Dynamic Services in UAV-Assisted MEC
    Hou, Peng
    Jiang, Xiaohan
    Wang, Zongshan
    Liu, Sen
    Lu, Zhihui
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 20415 - 20428
  • [32] Resource optimization for UAV-assisted mobile edge computing system based on deep reinforcement learning
    Yu, Fan
    Yang, Dingcheng
    Wu, Fahui
    Wang, Yapeng
    He, Hao
    PHYSICAL COMMUNICATION, 2023, 59
  • [33] Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-Assisted Mobile Edge Computing
    Wang, Liang
    Wang, Kezhi
    Pan, Cunhua
    Xu, Wei
    Aslam, Nauman
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (10) : 3536 - 3550
  • [34] Deep Reinforcement Learning-Based Resource Allocation in Cooperative UAV-Assisted Wireless Networks
    Luong, Phuong
    Gagnon, Francois
    Tran, Le-Nam
    Labeau, Fabrice
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (11) : 7610 - 7625
  • [35] Deep Reinforcement Learning-Based Collaborative Data Collection in UAV-Assisted Underwater IoT
    Fu, Xiuwen
    Kang, Shengqi
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 1611 - 1626
  • [36] Spectrum on Demand: A Competitive Open Market Model for Spectrum Sharing for UAV-Assisted Communications
    Ansari, Rafay iqbal
    Ashraf, Nouman
    Hassan, Syed Ali
    Deepak, C. G.
    Pervaiz, Haris
    Politis, Christos
    IEEE NETWORK, 2020, 34 (06): : 318 - 324
  • [37] Coordination in Adversarial Multi-Agent with Deep Reinforcement Learning under Partial Observability
    Diallo, Elhadji Amadou Oury
    Sugawara, Toshiharu
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 198 - 205
  • [38] UAV-assisted Internet of vehicles: A framework empowered by reinforcement learning and Blockchain
    Alagha, Ahmed
    Kadadha, Maha
    Mizouni, Rabeb
    Singh, Shakti
    Bentahar, Jamal
    Otrok, Hadi
    VEHICULAR COMMUNICATIONS, 2025, 52
  • [39] Resource Allocation in UAV-Assisted Wireless Networks Using Reinforcement Learning
    Luong, Phuong
    Gagnon, Francois
    Labeau, Fabrice
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [40] Trajectory Planning in UAV-Assisted Wireless Networks via Reinforcement Learning
    He, Simeng
    Zhang, Shangwei
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 232 - 237