Deep Reinforcement Learning for UAV-Assisted Spectrum Sharing Under Partial Observability

被引:0
|
作者
Zhang, Sigen [1 ]
Wang, Zhe [1 ]
Gao, Guanyu [1 ]
Li, Jun [2 ]
Zhang, Jie [2 ]
Yin, Ziyan [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; dynamic spectrum sharing; partially observable Markov decision process; deep reinforcement learning; ENERGY-EFFICIENT; TRAJECTORY DESIGN; NETWORKS; COMMUNICATION; ALLOCATION; 5G;
D O I
10.1109/VTC2023-Fall60731.2023.10333853
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a dynamic spectrum sharing scheme in an unmanned aerial vehicle (UAV) assisted cognitive radio network. The UAV serves as a secondary base station to provide communication services to multiple secondary users (SUs) by adaptively utilizing the spatio-temporal spectrum opportunities of multiple device-to-device primary users (PUs), where each PU's spectrum occupancy follows a two-state Markov process. We jointly optimize the UAV's trajectory and user association to maximize the expectation of its cumulative energy efficiency subject to the interference constraint of the PUs. We formulate this problem as a partially observable Markov decision process (POMDP), where the UAV can only observe the spectrum occupancy status of the adjacent PUs. Due to the lack of the PUs' spectrum occupancy statistics, we propose a model-free reinforcement learning algorithm named partially observable double deep Q network (PO-DDQN) to obtain the near-optimal spectrum sharing policy. Simulation results show that our proposed algorithm outperforms the baseline policy gradient (PG) algorithm in terms of convergence speed and the UAV's energy efficiency. Additionally, the spectrum utilization efficiency can be further enhanced when the UAV has wider observation radius, or if the PUs' spectrum occupancy exhibits stronger temporal correlation.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for UAV-Assisted Spectrum Sharing: A Minority Game Approach
    Ding, Xinyu
    Zhang, Jie
    Wang, Zhe
    Wang, Xuehe
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 44 - 50
  • [2] Deep Reinforcement Learning for UAV-Assisted Emergency Response
    Lee, Isabella
    Babu, Vignesh
    Caesar, Matthew
    Nicol, David
    PROCEEDINGS OF THE 17TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2020), 2021, : 327 - 336
  • [3] Spectrum Sharing in UAV-Assisted HetNet Based on CMB-AM Multi-Agent Deep Reinforcement Learning
    Guan, Wei
    Gao, Bo
    Xiong, Ke
    Lu, Yang
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [4] Deep Reinforcement Learning Driven UAV-Assisted Edge Computing
    Zhang, Liang
    Jabbari, Bijan
    Ansari, Nirwan
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25449 - 25459
  • [5] UAV-Assisted NOMA for Enhancing ISAC: A Deep Reinforcement Learning Solution
    Amhaz, Ali
    Elhattab, Mohamed
    Sharafeddine, Sanaa
    Assi, Chadi
    IEEE COMMUNICATIONS LETTERS, 2025, 29 (02) : 249 - 253
  • [6] UAV-Assisted Wireless Energy and Data Transfer With Deep Reinforcement Learning
    Xiong, Zehui
    Zhang, Yang
    Lim, Wei Yang Bryan
    Kang, Jiawen
    Niyato, Dusit
    Leung, Cyril
    Miao, Chunyan
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (01) : 85 - 99
  • [7] Agent Modelling under Partial Observability for Deep Reinforcement Learning
    Papoudakis, Georgios
    Christianos, Filippos
    Albrecht, Stefano V.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Deep Reinforcement Learning for Fresh Data Collection in UAV-assisted IoT Networks
    Yi, Mengjie
    Wang, Xijun
    Liu, Juan
    Zhang, Yan
    Bai, Bo
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 716 - 721
  • [9] Optimizing Energy Efficiency in UAV-Assisted Networks Using Deep Reinforcement Learning
    Omoniwa, Babatunji
    Galkin, Boris
    Dusparic, Ivana
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (08) : 1590 - 1594
  • [10] Deep Reinforcement Learning for Minimizing Age-of-Information in UAV-assisted Networks
    Abd-Elmagid, Mohamed A.
    Ferdowsi, Aidin
    Dhillon, Harpreet S.
    Saad, Walid
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,