Efficient Penetration Testing Path Planning Based on Reinforcement Learning with Episodic Memory

被引：0

作者：

Zhou, Ziqiao ^{[1
]}

Zhou, Tianyang ^{[1
]}

Xu, Jinghao ^{[2
]}

Zhu, Junhu ^{[1
]}

机构：

[1] Natl Engn Technol Res Ctr Digital Switching Syst, Henan Key Lab Informat Secur, Zhengzhou 450000, Peoples R China

[2] Informat Engn Univ, Sch Cryptog Engn, Zhengzhou 450000, Peoples R China

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 140卷 / 03期

关键词：

Intelligent penetration testing; penetration testing path planning; reinforcement learning; episodic memory; exploration strategy;

D O I：

10.32604/cmes.2023.028553

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Intelligent penetration testing is of great significance for the improvement of the security of information systems, and the critical issue is the planning of penetration test paths. In view of the difficulty for attackers to obtain complete network information in realistic network scenarios, Reinforcement Learning (RL) is a promising solution to discover the optimal penetration path under incomplete information about the target network. Existing RLbased methods are challenged by the sizeable discrete action space, which leads to difficulties in the convergence. Moreover, most methods still rely on experts' knowledge. To address these issues, this paper proposes a penetration path planning method based on reinforcement learning with episodic memory. First, the penetration testing problem is formally described in terms of reinforcement learning. To speed up the training process without specific prior knowledge, the proposed algorithm introduces episodic memory to store experienced advantageous strategies for the first time. Furthermore, the method offers an exploration strategy based on episodic memory to guide the agents in learning. The design makes full use of historical experience to achieve the purpose of reducing blind exploration and improving planning efficiency. Ultimately, comparison experiments are carried out with the existing RL-based methods. The results reveal that the proposed method has better convergence performance. The running time is reduced by more than 20%.

引用

页码：2613 / 2634

页数：22

共 50 条

[1] Reinforcement Learning for Efficient Network Penetration Testing
Ghanem, Mohamed C.
Chen, Thomas M.
INFORMATION, 2020, 11 (01)
[2] Deep Reinforcement Learning for Intelligent Penetration Testing Path Design
Yi, Junkai
Liu, Xiaoyan
APPLIED SCIENCES-BASEL, 2023, 13 (16):
[3] Efficient Deep Reinforcement Learning for Optimal Path Planning
Ren, Jing
Huang, Xishi
Huang, Raymond N.
ELECTRONICS, 2022, 11 (21)
[4] Quantum Deep Reinforcement Learning Based on Episodic Memory
Zhu X.
Hou X.
Wu S.
Zhu F.
Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (02): : 170 - 175
[5] Sample Efficient Reinforcement Learning Method via High Efficient Episodic Memory
Yang, Dujia
Qin, Xiaowei
Xu, Xiaodong
Li, Chensheng
Wei, Guo
IEEE ACCESS, 2020, 8 : 129274 - 129284
[6] Bionic Path Planning Fusing Episodic Memory Based on RatSLAM
Yu, Shumei
Xu, Haidong
Wu, Chong
Jiang, Xin
Sun, Rongchuan
Sun, Lining
BIOMIMETICS, 2023, 8 (01)
[7] LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning
Wei, Zhenglin
Sun, Tiejiang
Zhou, Mengjie
SYMMETRY-BASEL, 2024, 16 (11):
[8] A Survey on Penetration Path Planning in Automated Penetration Testing
Chen, Ziyang
Kang, Fei
Xiong, Xiaobing
Shu, Hui
APPLIED SCIENCES-BASEL, 2024, 14 (18):
[9] Robot path planning based on deep reinforcement learning
Long, Yinxin
He, Huajin
2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
[10] Robot Path Planning Based on Deep Reinforcement Learning
Zhang, Rui
Jiang, Yuhao
Wu Fenghua
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1697 - 1701

← 1 2 3 4 5 →