Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications

被引:0
|
作者
Qin, Yunhui [1 ]
Xing, Zhifang [1 ,3 ]
Li, Xulong [2 ]
Zhang, Zhongshan [3 ]
Zhang, Haijun [2 ]
机构
[1] Univ Sci & Technol Beijing, Natl Sch Elite Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Engn & Technol Res Ctr Convergence Network, Beijing, Peoples R China
[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Autonomous aerial vehicles; Jamming; Optimization; Trajectory; Resource management; Security; Communication system security; Unmanned aerial vehicle (UAV); periodic coverage evaluation; primal-dual optimization; deep reinforcement learning; constrained Markov decision process; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; SECRECY; ENERGY;
D O I
10.1109/TVT.2024.3450956
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Considering the UAVs' energy constraints and green communication requirements, this paper proposes a periodic coverage-assisted UAV secure communication system to maximize the worst-case average achievable secrecy rate.UAV base stations serve legitimate users while UAV jammers periodically dispatch interference signals to eavesdroppers. User scheduling, UAV trajectory and power allocation are modeled as a constrained Markov decision problem with coverage evaluation constraint. Then, the joint optimization of user scheduling, UAV trajectory and power allocation is achieved by the primal-dual soft actor-critic (SAC) algorithm. Specifically, the reward critic network assesses the secrecy rate and the cost critic network fits the coverage constraint. Meanwhile, the actor network generates the user scheduling, UAV trajectory and power allocation policy while updating the dual variables. For comparison, we also adopt other deep reinforcement learning (DRL) solutions namely the SAC algorithm and the twin-delayed deep deterministic policy gradient (TD3) as well as the traditional random method and greedy method. Simulation results show that the proposed algorithm performs best in the training speed, the reward performance and the secrecy rate.
引用
收藏
页码:19641 / 19652
页数:12
相关论文
共 50 条
  • [31] RIS-Assisted UAV-D2D Communications Exploiting Deep Reinforcement Learning
    YOU Qian
    XU Qian
    YANG Xin
    ZHANG Tao
    CHEN Ming
    ZTE Communications, 2023, 21 (02) : 61 - 69
  • [32] Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
    Ding, Yuhao
    Lavaei, Javad
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7396 - 7404
  • [33] Hybrid IRS-Assisted Secure Satellite Downlink Communications: A Fast Deep Reinforcement Learning Approach
    Ngo, Quynh Tu
    Phan, Khoa Tran
    Mahmood, Abdun
    Xiang, Wei
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2858 - 2869
  • [34] A Dual Deep Network Based Secure Deep Reinforcement Learning Method
    Zhu F.
    Wu W.
    Fu Y.-C.
    Liu Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1812 - 1826
  • [35] Deep Reinforcement Learning for UAV-Assisted Emergency Response
    Lee, Isabella
    Babu, Vignesh
    Caesar, Matthew
    Nicol, David
    PROCEEDINGS OF THE 17TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2020), 2021, : 327 - 336
  • [36] Energy Harvesting UAV-RIS-Assisted Maritime Communications Based on Deep Reinforcement Learning Against Jamming
    Yang, Helin
    Lin, Kailong
    Xiao, Liang
    Zhao, Yifeng
    Xiong, Zehui
    Han, Zhu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 9854 - 9868
  • [37] Learning for Path Planning and Coverage Mapping in UAV-Assisted Emergency Communications
    Steiger, Juaren
    Lu, Ning
    Sorour, Sameh
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [38] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
    Liu, Jintao
    Zeng, Feng
    Wang, Wei
    Sheng, Zhichao
    Wei, Xinchen
    Cumanan, Kanapathippillai
    CHINA COMMUNICATIONS, 2022, 19 (09) : 26 - 36
  • [39] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
    Jintao Liu
    Feng Zeng
    Wei Wang
    Zhichao Sheng
    Xinchen Wei
    Kanapathippillai Cumanan
    China Communications, 2022, 19 (09) : 26 - 36
  • [40] Reinforcement Learning Based Dual-UAV Trajectory Optimization for Secure Communication
    Qian, Zhouyi
    Deng, Zhixiang
    Cai, Changchun
    Li, Haochen
    ELECTRONICS, 2023, 12 (09)