Efficient Penetration Testing Path Planning Based on Reinforcement Learning with Episodic Memory

被引：0

作者：

Zhou, Ziqiao ^{[1
]}

Zhou, Tianyang ^{[1
]}

Xu, Jinghao ^{[2
]}

Zhu, Junhu ^{[1
]}

机构：

[1] Natl Engn Technol Res Ctr Digital Switching Syst, Henan Key Lab Informat Secur, Zhengzhou 450000, Peoples R China

[2] Informat Engn Univ, Sch Cryptog Engn, Zhengzhou 450000, Peoples R China

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 140卷 / 03期

关键词：

Intelligent penetration testing; penetration testing path planning; reinforcement learning; episodic memory; exploration strategy;

D O I：

10.32604/cmes.2023.028553

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Intelligent penetration testing is of great significance for the improvement of the security of information systems, and the critical issue is the planning of penetration test paths. In view of the difficulty for attackers to obtain complete network information in realistic network scenarios, Reinforcement Learning (RL) is a promising solution to discover the optimal penetration path under incomplete information about the target network. Existing RLbased methods are challenged by the sizeable discrete action space, which leads to difficulties in the convergence. Moreover, most methods still rely on experts' knowledge. To address these issues, this paper proposes a penetration path planning method based on reinforcement learning with episodic memory. First, the penetration testing problem is formally described in terms of reinforcement learning. To speed up the training process without specific prior knowledge, the proposed algorithm introduces episodic memory to store experienced advantageous strategies for the first time. Furthermore, the method offers an exploration strategy based on episodic memory to guide the agents in learning. The design makes full use of historical experience to achieve the purpose of reducing blind exploration and improving planning efficiency. Ultimately, comparison experiments are carried out with the existing RL-based methods. The results reveal that the proposed method has better convergence performance. The running time is reduced by more than 20%.

引用

页码：2613 / 2634

页数：22

共 50 条

[21] Hierarchical reinforcement learning for efficient and effective automated penetration testing of large networks
Ghanem, Mohamed C.
Chen, Thomas M.
Nepomuceno, Erivelton G.
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (02) : 281 - 303
[22] Hierarchical reinforcement learning for efficient and effective automated penetration testing of large networks
Mohamed C. Ghanem
Thomas M. Chen
Erivelton G. Nepomuceno
Journal of Intelligent Information Systems, 2023, 60 : 281 - 303
[23] A Reinforcement Learning-based Path Planning for Collaborative UAVs
Rahim, Shahnila
Razaq, Mian Muaz
Chang, Shih Yu
Peng, Limei
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1938 - 1943
[24] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
Zheng, Kaiyuan
Gao, Jingpeng
Shen, Liangxi
IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
[25] Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning
Yang, Jiachen
Ni, Jingfei
Xi, Meng
Wen, Jiabao
Li, Yang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1983 - 1996
[26] Research on path planning of robot based on deep reinforcement learning
Liu, Feng
Chen, Chang
Li, Zhihua
Guan, Zhi-Hong
Wang, Hua O.
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3730 - 3734
[27] Robot path planning in dynamic environment based on reinforcement learning
Zhuang, Xiao-Dong
Meng, Qing-Chun
Wei, Tian-Bin
Wang, Xu-Zhu
Tan, Rui
Li, Xiao-Jing
Journal of Harbin Institute of Technology (New Series), 2001, 8 (03) : 253 - 255
[28] A Deep Reinforcement Learning Based Approach for AGVs Path Planning
Guo, Xinde
Ren, Zhigang
Wu, Zongze
Lai, Jialun
Zeng, Deyu
Xie, Shengli
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6833 - 6838
[29] A Reinforcement Learning Based Online Coverage Path Planning Algorithm
Carvalho, Jose Pedro
Pedro Aguiar, A.
2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 81 - 86
[30] Robot path planning in dynamic environment based on reinforcement learning
庄晓东
孟庆春
魏天滨
王旭柱
谭锐
李筱菁
Journal of Harbin Institute of Technology, 2001, (03) : 253 - 255

← 1 2 3 4 5 →