Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications

被引:0
|
作者
Qin, Yunhui [1 ]
Xing, Zhifang [1 ,3 ]
Li, Xulong [2 ]
Zhang, Zhongshan [3 ]
Zhang, Haijun [2 ]
机构
[1] Univ Sci & Technol Beijing, Natl Sch Elite Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing Engn & Technol Res Ctr Convergence Network, Beijing, Peoples R China
[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Autonomous aerial vehicles; Jamming; Optimization; Trajectory; Resource management; Security; Communication system security; Unmanned aerial vehicle (UAV); periodic coverage evaluation; primal-dual optimization; deep reinforcement learning; constrained Markov decision process; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; SECRECY; ENERGY;
D O I
10.1109/TVT.2024.3450956
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Considering the UAVs' energy constraints and green communication requirements, this paper proposes a periodic coverage-assisted UAV secure communication system to maximize the worst-case average achievable secrecy rate.UAV base stations serve legitimate users while UAV jammers periodically dispatch interference signals to eavesdroppers. User scheduling, UAV trajectory and power allocation are modeled as a constrained Markov decision problem with coverage evaluation constraint. Then, the joint optimization of user scheduling, UAV trajectory and power allocation is achieved by the primal-dual soft actor-critic (SAC) algorithm. Specifically, the reward critic network assesses the secrecy rate and the cost critic network fits the coverage constraint. Meanwhile, the actor network generates the user scheduling, UAV trajectory and power allocation policy while updating the dual variables. For comparison, we also adopt other deep reinforcement learning (DRL) solutions namely the SAC algorithm and the twin-delayed deep deterministic policy gradient (TD3) as well as the traditional random method and greedy method. Simulation results show that the proposed algorithm performs best in the training speed, the reward performance and the secrecy rate.
引用
收藏
页码:19641 / 19652
页数:12
相关论文
共 50 条
  • [41] Primal-dual differential dynamic programming: A model-based reinforcement learning for constrained dynamic optimization
    Kim, Jong Woo
    Oh, Tae Hoon
    Son, Sang Hwan
    Lee, Jong Min
    COMPUTERS & CHEMICAL ENGINEERING, 2022, 167
  • [42] Achieving Zero Constraint Violation for Concave Utility Constrained Reinforcement Learning via Primal-Dual Approach
    Bai, Qinbo
    Bedi, Amrit Singh
    Agarwal, Mridul
    Koppel, Alec
    Aggarwal, Vaneet
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 78 : 975 - 1016
  • [43] Deep Learning for Secure UAV-Assisted RIS Communication Networks
    Mughal U.A.
    Alkhrijah Y.
    Almadhor A.
    Yuen C.
    IEEE Internet of Things Magazine, 2024, 7 (02): : 38 - 44
  • [44] Two-Time Scale Tracking Control of Flexible Robots With Primal-Dual Inverse Reinforcement Learning
    Que, Xuejie
    Wang, Zhenlei
    Zhang, Yanqi
    Su, Guanghao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [45] Hybrid UAV-Enabled Secure Offloading via Deep Reinforcement Learning
    Yoo, Seonghoon
    Jeong, Seongah
    Kang, Joonhyuk
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (06) : 972 - 976
  • [46] Deep Reinforcement Learning Aided Trajectory and Power Control for Secure UAV Communication
    Wang, Zhijian
    Su, Gongchao
    Chen, Bin
    Dai, Mingjun
    Lin, Xiaohui
    PROCEEDINGS OF THE 2024 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS, ICWCSN 2024, 2024, : 74 - 79
  • [47] UAV-AIDED CELLULAR COMMUNICATIONS WITH DEEP REINFORCEMENT LEARNING AGAINST JAMMING
    Lu, Xiaozhen
    Xiao, Liang
    Dai, Canhuang
    Dai, Huaiyu
    IEEE WIRELESS COMMUNICATIONS, 2020, 27 (04) : 48 - 53
  • [48] Trajectory Design for Overlay UAV-to-Device Communications by Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [49] Trajectory Design for UAV Communications with No-Fly Zones by Deep Reinforcement Learning
    Liu, Zhenrong
    Zeng, Yuan
    Zhang, Wei
    Gong, Yi
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [50] UAV-Aided Cellular Communications with Deep Reinforcement Learning against Jamming
    Lu, Xiaozhen
    Xiao, Liang
    Dai, Canhuang
    Dai, Huaiyu
    IEEE Wireless Communications, 2020, 27 (04): : 48 - 53