Efficient Penetration Testing Path Planning Based on Reinforcement Learning with Episodic Memory

被引:0
|
作者
Zhou, Ziqiao [1 ]
Zhou, Tianyang [1 ]
Xu, Jinghao [2 ]
Zhu, Junhu [1 ]
机构
[1] Natl Engn Technol Res Ctr Digital Switching Syst, Henan Key Lab Informat Secur, Zhengzhou 450000, Peoples R China
[2] Informat Engn Univ, Sch Cryptog Engn, Zhengzhou 450000, Peoples R China
来源
关键词
Intelligent penetration testing; penetration testing path planning; reinforcement learning; episodic memory; exploration strategy;
D O I
10.32604/cmes.2023.028553
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Intelligent penetration testing is of great significance for the improvement of the security of information systems, and the critical issue is the planning of penetration test paths. In view of the difficulty for attackers to obtain complete network information in realistic network scenarios, Reinforcement Learning (RL) is a promising solution to discover the optimal penetration path under incomplete information about the target network. Existing RLbased methods are challenged by the sizeable discrete action space, which leads to difficulties in the convergence. Moreover, most methods still rely on experts' knowledge. To address these issues, this paper proposes a penetration path planning method based on reinforcement learning with episodic memory. First, the penetration testing problem is formally described in terms of reinforcement learning. To speed up the training process without specific prior knowledge, the proposed algorithm introduces episodic memory to store experienced advantageous strategies for the first time. Furthermore, the method offers an exploration strategy based on episodic memory to guide the agents in learning. The design makes full use of historical experience to achieve the purpose of reducing blind exploration and improving planning efficiency. Ultimately, comparison experiments are carried out with the existing RL-based methods. The results reveal that the proposed method has better convergence performance. The running time is reduced by more than 20%.
引用
收藏
页码:2613 / 2634
页数:22
相关论文
共 50 条
  • [1] Reinforcement Learning for Efficient Network Penetration Testing
    Ghanem, Mohamed C.
    Chen, Thomas M.
    INFORMATION, 2020, 11 (01)
  • [2] Deep Reinforcement Learning for Intelligent Penetration Testing Path Design
    Yi, Junkai
    Liu, Xiaoyan
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [3] Efficient Deep Reinforcement Learning for Optimal Path Planning
    Ren, Jing
    Huang, Xishi
    Huang, Raymond N.
    ELECTRONICS, 2022, 11 (21)
  • [4] Quantum Deep Reinforcement Learning Based on Episodic Memory
    Zhu X.
    Hou X.
    Wu S.
    Zhu F.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (02): : 170 - 175
  • [5] Sample Efficient Reinforcement Learning Method via High Efficient Episodic Memory
    Yang, Dujia
    Qin, Xiaowei
    Xu, Xiaodong
    Li, Chensheng
    Wei, Guo
    IEEE ACCESS, 2020, 8 : 129274 - 129284
  • [6] Bionic Path Planning Fusing Episodic Memory Based on RatSLAM
    Yu, Shumei
    Xu, Haidong
    Wu, Chong
    Jiang, Xin
    Sun, Rongchuan
    Sun, Lining
    BIOMIMETICS, 2023, 8 (01)
  • [7] LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning
    Wei, Zhenglin
    Sun, Tiejiang
    Zhou, Mengjie
    SYMMETRY-BASEL, 2024, 16 (11):
  • [8] A Survey on Penetration Path Planning in Automated Penetration Testing
    Chen, Ziyang
    Kang, Fei
    Xiong, Xiaobing
    Shu, Hui
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [9] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [10] Robot Path Planning Based on Deep Reinforcement Learning
    Zhang, Rui
    Jiang, Yuhao
    Wu Fenghua
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701