Mean-Field Multi-Agent Reinforcement Learning for Adaptive Anti-Jamming Channel Selection in UAV Communications

被引:1
|
作者
Du, Feng [1 ]
Li, Jun [1 ]
Lin, Yan [1 ,3 ]
Wang, Zhe [2 ]
Qian, Yuwen [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV; anti-jamming; partially observable stochastic game; mean field; multi-agent reinforcement learning;
D O I
10.1109/WCSP55476.2022.10039304
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the large-scale anti-jamming UAV communication network, massive number of UAV users aim to compete for limited spectrum resources while fighting against possible external interference from malicious jammers. Specifically, each UAV-to-UAV (U2U) communication link targets at finding the optimal channel selection that maximizes its long-term expected achievable rate. We formulate the distributed multi-UAV anti-jamming problem as a partially observable stochastic game (POSG), where each UAV only has partial observability of the entire network environment due to the limited sensing capabilities. To deal with the complex interactions among large-scale UAVs, we simplify the POSG problem as a mean-field game, where each U2U link only interacts with the aggregate interference from the neighboring U2U links and the malicious jammers. We propose a soft mean-field Q learning (Soft-MFQ) algorithm to obtain the Nash equilibrium of the U2Us' channel selection policies in a model-free scenario. The simulation results show that the proposed algorithm outperforms other benchmark algorithms in terms of convergence speed and the average reward, especially when the number of UAVs is large.
引用
收藏
页码:910 / 915
页数:6
相关论文
共 50 条
  • [1] Mean-Field Multi-Agent Reinforcement Learning for Adaptive Anti-Jamming Channel Selection in UAV Communications
    Du, Feng
    Li, Jun
    Lin, Yan
    Wang, Zhe
    Qian, Yuwen
    [J]. 2022 IEEE 14th International Conference on Wireless Communications and Signal Processing, WCSP 2022, 2022, : 910 - 915
  • [2] Meta-Reinforcement Learning in Time-Varying UAV Communications: Adaptive Anti-Jamming Channel Selection
    Hu, Linzi
    Shao, Yumeng
    Qian, Yuwen
    Du, Feng
    Li, Jun
    Lin, Yan
    Wang, Zhe
    [J]. RADIOENGINEERING, 2024, 33 (03) : 417 - 431
  • [3] Multi-agent Reinforcement Learning Based Cognitive Anti-jamming
    Aref, Mohamed A.
    Jayaweera, Sudharman K.
    Machuzak, Stephen
    [J]. 2017 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2017,
  • [4] Anti-Jamming Communications in UAV Swarms: A Reinforcement Learning Approach
    Peng, Jinlin
    Zhang, Zixuan
    Wu, Qinhao
    Zhang, Bo
    [J]. IEEE ACCESS, 2019, 7 : 180532 - 180543
  • [5] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [6] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    [J]. INFORMATION SCIENCES, 2024, 669
  • [7] A Collaborative Multi-Agent Reinforcement Learning Anti-Jamming Algorithm in Wireless Networks
    Yao, Fuqiang
    Jia, Luliang
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (04) : 1024 - 1027
  • [8] Multi-agent Learning based Anti-jamming Communications Against Cognitive Jammers
    Jayaweera, Milidu N.
    [J]. 2021 30TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC 2021), 2021, : 122 - 126
  • [9] A multi-agent reinforcement learning anti-jamming method with partially overlapping channels
    Zhang, Yunpeng
    Jia, Luliang
    Qi, Nan
    Xu, Yifan
    Chen, Xueqiang
    [J]. IET COMMUNICATIONS, 2021, 15 (19) : 2461 - 2468
  • [10] Towards reinforcement learning in UAV relay for anti-jamming maritime communications
    Liu, Chuhuan
    Zhang, Yi
    Niu, Guohang
    Jia, Luliang
    Xiao, Liang
    Luan, Jiangxia
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (06) : 1477 - 1485