UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning

被引:22
|
作者
Li, Zhiwei [1 ]
Lu, Yu [2 ]
Li, Xi [2 ]
Wang, Zengguang [3 ]
Qiao, Wenxin [2 ]
Liu, Yicen [2 ]
机构
[1] Army Engn Univ, UAV Engn Dept, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
[2] Aircraft Maintenance Ctr, Shijiazhuang Campus, Yongji 044500, Peoples R China
[3] Natl Def Univ, Shijiazhuang 050000, Hebei, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 15期
关键词
Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks; STACKELBERG GAME; TRANSMISSION;
D O I
10.1109/JIOT.2021.3062659
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.
引用
收藏
页码:12289 / 12310
页数:22
相关论文
共 50 条
  • [21] Reinforcement learning based energy efficient robot relay for unmanned aerial vehicles against smart jamming
    Xiaozhen Lu
    Jingfang Jie
    Zihan Lin
    Liang Xiao
    Jin Li
    Yanyong Zhang
    Science China Information Sciences, 2022, 65
  • [22] Playing a Strategy Game with Knowledge-Based Reinforcement Learning
    Voss V.
    Nechepurenko L.
    Schaefer R.
    Bauer S.
    SN Computer Science, 2020, 1 (2)
  • [23] Reinforcement Learning Based Topology Control for UAV Networks
    Yoo, Taehoon
    Lee, Sangmin
    Yoo, Kyeonghyun
    Kim, Hwangnam
    SENSORS, 2023, 23 (02)
  • [24] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
    Xiao, Liang
    Li, Yan
    Liu, Jinliang
    Zhao, Yifeng
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (09): : 3237 - 3257
  • [25] Deep Reinforcement Learning-Based Resource Management for UAV-Assisted Mobile Edge Computing Against Jamming
    Shao, Ziling
    Yang, Helin
    Xiao, Liang
    Su, Wei
    Chen, Yifan
    Xiong, Zehui
    IEEE Transactions on Mobile Computing, 2024, 23 (12) : 13358 - 13374
  • [26] Power control with reinforcement learning in cooperative cognitive radio networks against jamming
    Liang Xiao
    Yan Li
    Jinliang Liu
    Yifeng Zhao
    The Journal of Supercomputing, 2015, 71 : 3237 - 3257
  • [27] Reinforcement-Learning-Based Relay Mobility and Power Allocation for Underwater Sensor Networks Against Jamming
    Xiao, Liang
    Jiang, Donghua
    Chen, Ye
    Su, Wei
    Tang, Yuliang
    IEEE JOURNAL OF OCEANIC ENGINEERING, 2020, 45 (03) : 1148 - 1156
  • [28] Reinforcement Learning in Multiple-UAV Networks: Deployment and Movement Design
    Liu, Xiao
    Liu, Yuanwei
    Chen, Yue
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (08) : 8036 - 8049
  • [29] Domain Knowledge-Based Evolutionary Reinforcement Learning for Sensor Placement
    Song, Mingxuan
    Hu, Chengyu
    Gong, Wenyin
    Yan, Xuesong
    SENSORS, 2022, 22 (10)
  • [30] UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning
    Li, Bo
    Gan, Zhigang
    Chen, Daqing
    Sergey Aleksandrovich, Dyachenko
    REMOTE SENSING, 2020, 12 (22) : 1 - 20