Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator

被引:14
|
作者
Pozdniakov, Konstantin [1 ]
Alonso, Eduardo [1 ]
Stankovic, Vladimir [1 ]
Tam, Kimberly [2 ]
Jones, Kevin [2 ]
机构
[1] City Univ London, London, England
[2] Univ Plymouth, Plymouth, Devon, England
关键词
Pentesting; audit; Q-learning; reinforcement learning; deep neural network; MODEL CHECKING;
D O I
10.1109/cybersa49311.2020.9139683
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A significant challenge in modern computer security is the growing skill gap as intruder capabilities increase, making it necessary to begin automating elements of penetration testing so analysts can contend with the growing number of cyber threats. In this paper, we attempt to assist human analysts by automating a single host penetration attack. To do so, a smart agent performs different attack sequences to find vulnerabilities in a target system. As it does so, it accumulates knowledge, learns new attack sequences and improves its own internal penetration testing logic. As a result, this agent (AgentPen for simplicity) is able to successfully penetrate hosts it has never interacted with before. A computer security administrator using this tool would receive a comprehensive, automated sequence of actions leading to a security breach, highlighting potential vulnerabilities, and reducing the amount of menial tasks a typical penetration tester would need to execute. To achieve autonomy, we apply an unsupervised machine learning algorithm, Q-learning, with an approximator that incorporates a deep neural network architecture. The security audit itself is modelled as a Markov Decision Process in order to test a number of decision-making strategies and compare their convergence to optimality. A series of experimental results is presented to show how this approach can be effectively used to automate penetration testing using a scalable, i.e. not exhaustive, and adaptive approach.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] On deep reinforcement learning security for Industrial Internet of Things
    Liu, Xing
    Yu, Wei
    Liang, Fan
    Griffith, David
    Golmie, Nada
    [J]. COMPUTER COMMUNICATIONS, 2021, 168 : 20 - 32
  • [42] An Approach to Combine the Power of Deep Reinforcement Learning with a Graph Neural Network for Routing Optimization
    Chen, Bo
    Zhu, Di
    Wang, Yuwei
    Zhang, Peng
    [J]. ELECTRONICS, 2022, 11 (03)
  • [43] A new automatic convolutional neural network based on deep reinforcement learning for fault diagnosis
    Long Wen
    You Wang
    Xinyu Li
    [J]. Frontiers of Mechanical Engineering, 2022, 17
  • [44] A new automatic convolutional neural network based on deep reinforcement learning for fault diagnosis
    Long WEN
    You WANG
    Xinyu LI
    [J]. Frontiers of Mechanical Engineering., 2022, 17 (02) - 201
  • [45] A new automatic convolutional neural network based on deep reinforcement learning for fault diagnosis
    Wen, Long
    Wang, You
    Li, Xinyu
    [J]. FRONTIERS OF MECHANICAL ENGINEERING, 2022, 17 (02)
  • [46] Automatic Compression of Neural Network with Deep Reinforcement Learning Based on Proximal Gradient Method
    Wang, Mingyi
    Tang, Jianhao
    Zhao, Haoli
    Li, Zhenni
    Xie, Shengli
    [J]. MATHEMATICS, 2023, 11 (02)
  • [47] Application of Features and Neural Network to Enhance the Performance of Deep Reinforcement Learning in Portfolio Management
    Gu, Fengchen
    Jiang, Zhengyong
    Su, Jionglong
    [J]. 2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 92 - 97
  • [48] Deep Reinforcement Learning with Fully Convolutional Neural Network to Solve An Earthwork Scheduling Problem
    Woo, Seongcheol
    Yeon, Juneyeong
    Ji, Mingi
    Moon, Il-Chul
    Park, Jinkyoo
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 4230 - 4236
  • [49] Application of Quantum Reinforcement Learning and Deep Neural Network for Radiotherapy Clinical Decision Support
    Niraula, D.
    Jamaluddin, J.
    Ten Haken, R.
    El Naqa, I.
    [J]. MEDICAL PHYSICS, 2020, 47 (06) : E313 - E313
  • [50] GROM: A generalized routing optimization method with graph neural network and deep reinforcement learning
    Ding, Mingjie
    Guo, Yingya
    Huang, Zebo
    Lin, Bin
    Luo, Huan
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 229