An Intelligent Strategy Decision Method for Collaborative Jamming Based on Hierarchical Multi-Agent Reinforcement Learning

被引:0
|
作者
Zhang, Wenxu [1 ,2 ,3 ]
Zhao, Tong [1 ,2 ,3 ]
Zhao, Zhongkai [1 ,2 ,3 ]
Wang, Yajie [1 ,2 ,3 ]
Liu, Feiran [4 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Marine Commun & Informat Technol, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[3] Harbin Engn Univ, AVIC United Technol Ctr Electromagnet Spectrum Col, Harbin 150001, Heilongjiang, Peoples R China
[4] Wright State Univ, Dept Elect Engn, Dayton, OH 45435 USA
关键词
Jamming; Decision making; Radar; Frequency diversity; Reinforcement learning; Training; Time-frequency analysis; Cooperative jamming decision-making; resource allocation; hierarchical reinforcement learning; multi-agent reinforcement learning; prioritized experience replay; WAVE-FORM; RADAR; ALLOCATION;
D O I
10.1109/TCCN.2024.3373640
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of intelligent cooperative jamming decision-making against frequency agility and frequency diversity in cognitive electronic warfare, an intelligent cooperative jamming strategy decision-making method based on hierarchical multi-agent reinforcement learning is proposed. The multi-agent markov decision process (MDP) is used to construct the multi-jammer cooperative decision-making process. The cooperative jamming decision-making in frequency domain (FD-CJDM) model is established. The design idea of hierarchical reinforcement learning (HRL) is introduced. In order to find the optimal strategy, a double-depth Q-network based on the prioritized experience replay (PER-DDQN) optimization method of sum tree structure is adopted. The performance of FD-CJDM model based on PER-DDQN is simulated. Simulation results show that the proposed PER-DDQN method is obviously superior to deep Q network (DQN) method in action estimation, and its convergence performance is faster than that of double-depth Q network (DDQN). In addition, the intelligent decision-making method of cooperative jamming proposed in this paper can fomulate the frequency domain parameter decision-making strategy according to the order of real-time detected radar threats, which effectively realizes the design of intelligent decision-making in frequency domain.
引用
收藏
页码:1467 / 1480
页数:14
相关论文
共 50 条
  • [31] Collaborative Decision-making in Heterogeneous UAV Swarms based on Multi-agent Deep Reinforcement Learning
    Yang, Feng
    Li, Zhi
    Fu, Jiahao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2138 - 2145
  • [32] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    APPLIED ENERGY, 2023, 332
  • [33] Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Bohmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 80 - 92
  • [34] A collaborative optimization strategy for computing offloading and resource allocation based on multi-agent deep reinforcement learning
    Jiang, Yingying
    Mao, Yuxuan
    Wu, Gaoxiang
    Cai, Zhenhua
    Hao, Yixue
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103
  • [35] Research on Integrated Control Strategy for Highway Merging Bottlenecks Based on Collaborative Multi-Agent Reinforcement Learning
    Du, Juan
    Yu, Anshuang
    Zhou, Hao
    Jiang, Qianli
    Bai, Xueying
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [36] A Multi-agent Reinforcement Learning Method for Swarm Robots in Space Collaborative Exploration
    Huang, Yixin
    Wu, Shufan
    Mu, Zhongcheng
    Long, Xiangyu
    Chu, Sunhao
    Zhao, Guohong
    2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 139 - 144
  • [37] Intelligent Anti-Jamming Communication for Wireless Sensor Networks: A Multi-Agent Reinforcement Learning Approach
    Zhou Q.
    Li Y.
    Niu Y.
    Niu, Yingtao (niuyingtao78@163.com), 2021, Institute of Electrical and Electronics Engineers Inc. (02): : 775 - 784
  • [38] HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy
    Zhang, Hancheng
    Li, Guozheng
    Liu, Chi Harold
    Wang, Guoren
    Tang, Jian
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3239 - 3248
  • [39] Train rescheduling method based on multi-agent reinforcement learning
    Cao, Yuli
    Xu, Zhongwei
    Mei, Meng
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 301 - 305
  • [40] Multi-Vehicle Collaborative Lane Changing Based on Multi-Agent Reinforcement Learning
    Zhang, Xiang
    Li, Shihao
    Wang, Boyang
    Xue, Mingxuan
    Li, Zhiwei
    Liu, Haiou
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1214 - 1221