Efficient Jamming Policy Generation Method Based on Multi-Timescale Ensemble Q-Learning

被引:0
|
作者
Qian, Jialong [1 ]
Zhou, Qingsong [1 ]
Li, Zhihui [1 ]
Yang, Zhongping [1 ]
Shi, Shasha [1 ]
Xu, Zhenjia [1 ]
Xu, Qiyun [2 ]
机构
[1] Natl Univ Def Technol, Coll Elect Engn, Hefei 230037, Peoples R China
[2] PLA, Unit 93216, Beijing 100085, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
jamming policy generation; multifunctional radar; Q-learning; multi-timescale ensemble; RADAR;
D O I
10.3390/rs16173158
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the advancement of radar technology toward multifunctionality and cognitive capabilities, traditional radar countermeasures are no longer sufficient to meet the demands of countering the advanced multifunctional radar (MFR) systems. Rapid and accurate generation of the optimal jamming strategy is one of the key technologies for efficiently completing radar countermeasures. To enhance the efficiency and accuracy of jamming policy generation, an efficient jamming policy generation method based on multi-timescale ensemble Q-learning (MTEQL) is proposed in this paper. First, the task of generating jamming strategies is framed as a Markov decision process (MDP) by constructing a countermeasure scenario between the jammer and radar, while analyzing the principle radar operation mode transitions. Then, multiple structure-dependent Markov environments are created based on the real-world adversarial interactions between jammers and radars. Q-learning algorithms are executed concurrently in these environments, and their results are merged through an adaptive weighting mechanism that utilizes the Jensen-Shannon divergence (JSD). Ultimately, a low-complexity and near-optimal jamming policy is derived. Simulation results indicate that the proposed method has superior jamming policy generation performance compared with the Q-learning algorithm, in terms of the short jamming decision-making time and low average strategy error rate.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] An efficient multi-agent Q-learning method based on observing the adversary agent state change
    Sun, Ruoying
    Zhao, Gang
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4169 - +
  • [12] Cognitive Electronic Jamming Decision-Making Method Based on Improved Q-Learning Algorithm
    Li, Huiqin
    Li, Yanling
    He, Chuan
    Zhan, Jianwei
    Zhang, Hui
    INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING, 2021, 2021
  • [13] A goal-conditioned policy search method with multi-timescale value function tuning
    Jiang, Zhihong
    Hu, Jiachen
    Zhao, Yan
    Huang, Xiao
    Li, Hui
    ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, 44 (04): : 549 - 559
  • [14] Intelligent Decision Method of Slope Perturbing Based on Q-Learning for Anti-Deception Jamming
    Wei, Jingjing
    Yu, Lei
    Xu, Rongqing
    2022 6TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC, 2022, : 71 - 76
  • [15] Multi-Agent Coordination Method Based on Fuzzy Q-Learning
    Peng, Jun
    Liu, Miao
    Wu, Min
    Zhang, Xiaoyong
    Lin, Kuo-Chi
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5411 - +
  • [16] Multi-Domain Resource Scheduling for Surveillance Radar Anti-Jamming based on Q-Learning
    Yang, Tao
    Yuan, Ye
    Yi, Wei
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [17] Multi-Domain Resource Scheduling for Surveillance Radar Anti-Jamming based on Q-Learning
    Yang, Tao
    Yuan, Ye
    Yi, Wei
    Proceedings of the IEEE Radar Conference, 2023, 2023-May
  • [18] A Q-learning based method of optimal fault diagnostic policy with imperfect tests
    Liang, Yajun
    Xiao, Mingqing
    Tang, Xilang
    Ge, Yawei
    Wang, Xiaofei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (06) : 6013 - 6024
  • [19] A Multi-Timescale Wind Power Forecasting Method Based on Selection of Similar Days
    Wang Shen-zhe
    Gao Shan
    Zhao Xin
    Zhang Ningyu
    2016 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2016,
  • [20] Multi-objective route recommendation method based on Q-learning algorithm
    Yu, Qingying
    Xiao, Zhenxing
    Yang, Feng
    Gong, Shan
    Shi, Gege
    Chen, Chuanming
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (04) : 7009 - 7025