Efficient Penetration Testing Path Planning Based on Reinforcement Learning with Episodic Memory

被引:0
|
作者
Zhou, Ziqiao [1 ]
Zhou, Tianyang [1 ]
Xu, Jinghao [2 ]
Zhu, Junhu [1 ]
机构
[1] Natl Engn Technol Res Ctr Digital Switching Syst, Henan Key Lab Informat Secur, Zhengzhou 450000, Peoples R China
[2] Informat Engn Univ, Sch Cryptog Engn, Zhengzhou 450000, Peoples R China
来源
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 140卷 / 03期
关键词
Intelligent penetration testing; penetration testing path planning; reinforcement learning; episodic memory; exploration strategy;
D O I
10.32604/cmes.2023.028553
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Intelligent penetration testing is of great significance for the improvement of the security of information systems, and the critical issue is the planning of penetration test paths. In view of the difficulty for attackers to obtain complete network information in realistic network scenarios, Reinforcement Learning (RL) is a promising solution to discover the optimal penetration path under incomplete information about the target network. Existing RLbased methods are challenged by the sizeable discrete action space, which leads to difficulties in the convergence. Moreover, most methods still rely on experts' knowledge. To address these issues, this paper proposes a penetration path planning method based on reinforcement learning with episodic memory. First, the penetration testing problem is formally described in terms of reinforcement learning. To speed up the training process without specific prior knowledge, the proposed algorithm introduces episodic memory to store experienced advantageous strategies for the first time. Furthermore, the method offers an exploration strategy based on episodic memory to guide the agents in learning. The design makes full use of historical experience to achieve the purpose of reducing blind exploration and improving planning efficiency. Ultimately, comparison experiments are carried out with the existing RL-based methods. The results reveal that the proposed method has better convergence performance. The running time is reduced by more than 20%.
引用
收藏
页码:2613 / 2634
页数:22
相关论文
共 50 条
  • [21] Hierarchical reinforcement learning for efficient and effective automated penetration testing of large networks
    Ghanem, Mohamed C.
    Chen, Thomas M.
    Nepomuceno, Erivelton G.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (02) : 281 - 303
  • [22] Hierarchical reinforcement learning for efficient and effective automated penetration testing of large networks
    Mohamed C. Ghanem
    Thomas M. Chen
    Erivelton G. Nepomuceno
    Journal of Intelligent Information Systems, 2023, 60 : 281 - 303
  • [23] A Reinforcement Learning-based Path Planning for Collaborative UAVs
    Rahim, Shahnila
    Razaq, Mian Muaz
    Chang, Shih Yu
    Peng, Limei
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1938 - 1943
  • [24] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaiyuan
    Gao, Jingpeng
    Shen, Liangxi
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
  • [25] Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning
    Yang, Jiachen
    Ni, Jingfei
    Xi, Meng
    Wen, Jiabao
    Li, Yang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1983 - 1996
  • [26] Research on path planning of robot based on deep reinforcement learning
    Liu, Feng
    Chen, Chang
    Li, Zhihua
    Guan, Zhi-Hong
    Wang, Hua O.
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3730 - 3734
  • [27] Robot path planning in dynamic environment based on reinforcement learning
    Zhuang, Xiao-Dong
    Meng, Qing-Chun
    Wei, Tian-Bin
    Wang, Xu-Zhu
    Tan, Rui
    Li, Xiao-Jing
    Journal of Harbin Institute of Technology (New Series), 2001, 8 (03) : 253 - 255
  • [28] A Deep Reinforcement Learning Based Approach for AGVs Path Planning
    Guo, Xinde
    Ren, Zhigang
    Wu, Zongze
    Lai, Jialun
    Zeng, Deyu
    Xie, Shengli
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6833 - 6838
  • [29] A Reinforcement Learning Based Online Coverage Path Planning Algorithm
    Carvalho, Jose Pedro
    Pedro Aguiar, A.
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 81 - 86
  • [30] Robot path planning in dynamic environment based on reinforcement learning
    庄晓东
    孟庆春
    魏天滨
    王旭柱
    谭锐
    李筱菁
    Journal of Harbin Institute of Technology, 2001, (03) : 253 - 255