Finding the Optimal Security Policies for Autonomous Cyber Operations With Competitive Reinforcement Learning

被引：0

作者：

McDonald, Garrett ^{[1
]}

Li, Li ^{[2
]}

Mallah, Ranwa Al ^{[1
]}

机构：

[1] Royal Mil Coll Canada, Dept Elect & Comp Engn, Kingston, ON K7K 7B4, Canada

[2] Def Res & Dev Canada, Toronto, ON M3K 2C9, Canada

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Autonomous cyber operations; competitive reinforcement learning; fictitious play; neural networks; multi-agent;

D O I：

10.1109/ACCESS.2024.3446310

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement Learning (RL) has been responsible for some of the most impressive advances in the field of Artificial Intelligence (AI). Research in competitive RL has shown that multiple agents competing in an adversarial environment can learn simultaneously in order to discover their optimal decision-making policies. Competitive RL algorithms have been used to train performant AI for a variety of games and optimization problems. Cybersecurity is a domain where the emerging research in competitive RL is being considered for its real-world application. In order to develop Automated Cyber Operations (ACO) tools using RL, various open-source environments are available to simulate network security incidents. However, the existing research in these environments is typically one-sided: a Red or Blue agent is trained to optimize their decision-making against a static opponent. Competitive RL has not been attempted in these emerging environments. In this work, we trained agents using competitive RL to approximate their game theory optimal policies in a simulated ACO environment. We showed that near-optimal behavior was reached gradually through fictitious play demonstrating that these strategies can be used to approximate the optimal policies for agents involved in sophisticated sequential decision-making during a cyber attack.

引用

页码：120292 / 120305

页数：14

共 50 条

[1] Autonomous Cyber Warfare Agents: Dynamic Reinforcement Learning for Defensive Cyber Operations
Bierbrauer, David A.
Schabinger, Robert M.
Carlin, Caleb
Mullin, Jonathan
Pavlik, John A.
Bastian, Nathaniel D.
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
[2] Deep Reinforcement Learning for Cyber Security
Thanh Thi Nguyen
Reddi, Vijay Janapa
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3779 - 3795
[3] Adversarial Reinforcement Learning in a Cyber Security Simulation
Elderman, Richard
Pater, Leon J. J.
Thie, Albert S.
Drugan, Madalina M.
Wiering, Marco A.
[J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 559 - 566
[4] Cyber Resilience Using Autonomous Agents and Reinforcement Learning
Cam, Hasan
[J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
[5] An adversarial reinforcement learning based system for cyber security
Xia, Song
Qiu, Meikang
Jiang, Hao
[J]. 4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 227 - 230
[6] Cyber-security and reinforcement learning - A brief survey
Adawadkar, Amrin Maria Khan
Kulkarni, Nilima
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
[7] Causally aware reinforcement learning agents for autonomous cyber defence
Purves, Tom
Kyriakopoulos, Konstantinos G.
Jenkins, Sian
Phillips, Iain
Dudman, Tim
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304
[8] Adversarial Manipulation of Reinforcement Learning Policies in Autonomous Agents
Huang, Yonghong
Wang, Shih-han
[J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[9] Towards Optimal Attacks on Reinforcement Learning Policies
Russo, Alessio
Proutiere, Alexandre
[J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 4561 - 4567
[10] Learning positioning policies for mobile manipulation operations with deep reinforcement learning
Ander Iriondo
Elena Lazkano
Ander Ansuategi
Andoni Rivera
Iker Lluvia
Carlos Tubío
[J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3003 - 3023

← 1 2 3 4 5 →