Efficient Jamming Policy Generation Method Based on Multi-Timescale Ensemble Q-Learning

被引:0
|
作者
Qian, Jialong [1 ]
Zhou, Qingsong [1 ]
Li, Zhihui [1 ]
Yang, Zhongping [1 ]
Shi, Shasha [1 ]
Xu, Zhenjia [1 ]
Xu, Qiyun [2 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
[2] PLA, Unit 93216, Beijing 100085, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
jamming policy generation; multifunctional radar; Q-learning; multi-timescale ensemble; RADAR;
D O I
10.3390/rs16173158
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the advancement of radar technology toward multifunctionality and cognitive capabilities, traditional radar countermeasures are no longer sufficient to meet the demands of countering the advanced multifunctional radar (MFR) systems. Rapid and accurate generation of the optimal jamming strategy is one of the key technologies for efficiently completing radar countermeasures. To enhance the efficiency and accuracy of jamming policy generation, an efficient jamming policy generation method based on multi-timescale ensemble Q-learning (MTEQL) is proposed in this paper. First, the task of generating jamming strategies is framed as a Markov decision process (MDP) by constructing a countermeasure scenario between the jammer and radar, while analyzing the principle radar operation mode transitions. Then, multiple structure-dependent Markov environments are created based on the real-world adversarial interactions between jammers and radars. Q-learning algorithms are executed concurrently in these environments, and their results are merged through an adaptive weighting mechanism that utilizes the Jensen-Shannon divergence (JSD). Ultimately, a low-complexity and near-optimal jamming policy is derived. Simulation results indicate that the proposed method has superior jamming policy generation performance compared with the Q-learning algorithm, in terms of the short jamming decision-making time and low average strategy error rate.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Q-Learning Based Adaptive Frequency Hopping Strategy Under Probabilistic Jamming
    Wang, Yutao
    Niu, Yingtao
    Chen, Jianzhong
    Fang, Fang
    Han, Chen
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [22] Multi-timescale voltage control for distribution system based on multi-agent deep reinforcement learning
    Wu, Zhi
    Li, Yiqi
    Gu, Wei
    Dong, Zengbo
    Zhao, Jingtao
    Liu, Weiliang
    Zhang, Xiao-Ping
    Liu, Pengxiang
    Sun, Qirun
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 147
  • [23] Rapid jamming recognition based on Q-learning sampling under resource constraints
    Gao, Kejie
    Zhu, Yonggang
    Wang, Youbao
    Ge, Rong
    Wang, Hao
    2024 4th International Conference on Information Communication and Software Engineering, ICICSE 2024, 2024, : 109 - 114
  • [24] Rapid jamming recognition based on Q-learning sampling under resource constraints
    Gao, Kejie
    Zhu, Yonggang
    Wang, Youbao
    Ge, Rong
    Wang, Hao
    2024 4TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING, ICICSE 2024, 2024, : 109 - 114
  • [25] Efficient off-policy Q-learning for multi-agent systems by solving dual games
    Wang, Yan
    Xue, Huiwen
    Wen, Jiwei
    Liu, Jinfeng
    Luan, Xiaoli
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (06) : 4193 - 4212
  • [26] Multi-timescale feature extraction method of wastewater treatment process based on adaptive entropy
    Han, Honggui
    Zhao, Yaqian
    Wu, Xiaolong
    Yang, Hongyan
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2024, 76 : 264 - 271
  • [27] Q-Learning Based Two-Timescale Power Allocation for Multi-Homing Hybrid RF/VLC Networks
    Kong, Justin
    Wu, Zi-Yang
    Ismail, Muhammad
    Serpedin, Erchin
    Qaraqe, Khalid A.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (04) : 443 - 447
  • [28] Multi-timescale feature extraction method of wastewater treatment process based on adaptive entropy
    Honggui Han
    Yaqian Zhao
    Xiaolong Wu
    Hongyan Yang
    Chinese Journal of Chemical Engineering, 2024, 76 (12) : 264 - 271
  • [29] An efficient multi-timescale regulation strategy for distribution networks based on active and passive resources combined
    Li, Kewen
    Lin, Xinhao
    Zhang, Wei
    Yu, Lei
    Chen, Qianyi
    Liu, Yinliang
    Ou, Shifeng
    Xu, Min
    Li, Junhao
    FRONTIERS IN ENERGY RESEARCH, 2024, 12
  • [30] A Novel Ensemble Q-Learning Algorithm for Policy Optimization in Large-Scale Networks
    Bozkus, Talha
    Mitra, Urbashi
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1381 - 1386