Finding the Optimal Security Policies for Autonomous Cyber Operations With Competitive Reinforcement Learning

被引:0
|
作者
McDonald, Garrett [1 ]
Li, Li [2 ]
Mallah, Ranwa Al [1 ]
机构
[1] Royal Mil Coll Canada, Dept Elect & Comp Engn, Kingston, ON K7K 7B4, Canada
[2] Def Res & Dev Canada, Toronto, ON M3K 2C9, Canada
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Autonomous cyber operations; competitive reinforcement learning; fictitious play; neural networks; multi-agent;
D O I
10.1109/ACCESS.2024.3446310
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement Learning (RL) has been responsible for some of the most impressive advances in the field of Artificial Intelligence (AI). Research in competitive RL has shown that multiple agents competing in an adversarial environment can learn simultaneously in order to discover their optimal decision-making policies. Competitive RL algorithms have been used to train performant AI for a variety of games and optimization problems. Cybersecurity is a domain where the emerging research in competitive RL is being considered for its real-world application. In order to develop Automated Cyber Operations (ACO) tools using RL, various open-source environments are available to simulate network security incidents. However, the existing research in these environments is typically one-sided: a Red or Blue agent is trained to optimize their decision-making against a static opponent. Competitive RL has not been attempted in these emerging environments. In this work, we trained agents using competitive RL to approximate their game theory optimal policies in a simulated ACO environment. We showed that near-optimal behavior was reached gradually through fictitious play demonstrating that these strategies can be used to approximate the optimal policies for agents involved in sophisticated sequential decision-making during a cyber attack.
引用
收藏
页码:120292 / 120305
页数:14
相关论文
共 50 条
  • [1] Autonomous Cyber Warfare Agents: Dynamic Reinforcement Learning for Defensive Cyber Operations
    Bierbrauer, David A.
    Schabinger, Robert M.
    Carlin, Caleb
    Mullin, Jonathan
    Pavlik, John A.
    Bastian, Nathaniel D.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [2] Deep Reinforcement Learning for Cyber Security
    Thanh Thi Nguyen
    Reddi, Vijay Janapa
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3779 - 3795
  • [3] Adversarial Reinforcement Learning in a Cyber Security Simulation
    Elderman, Richard
    Pater, Leon J. J.
    Thie, Albert S.
    Drugan, Madalina M.
    Wiering, Marco A.
    [J]. ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 559 - 566
  • [4] Cyber Resilience Using Autonomous Agents and Reinforcement Learning
    Cam, Hasan
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
  • [5] An adversarial reinforcement learning based system for cyber security
    Xia, Song
    Qiu, Meikang
    Jiang, Hao
    [J]. 4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 227 - 230
  • [6] Cyber-security and reinforcement learning - A brief survey
    Adawadkar, Amrin Maria Khan
    Kulkarni, Nilima
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
  • [7] Causally aware reinforcement learning agents for autonomous cyber defence
    Purves, Tom
    Kyriakopoulos, Konstantinos G.
    Jenkins, Sian
    Phillips, Iain
    Dudman, Tim
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [8] Adversarial Manipulation of Reinforcement Learning Policies in Autonomous Agents
    Huang, Yonghong
    Wang, Shih-han
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [9] Towards Optimal Attacks on Reinforcement Learning Policies
    Russo, Alessio
    Proutiere, Alexandre
    [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 4561 - 4567
  • [10] Learning positioning policies for mobile manipulation operations with deep reinforcement learning
    Ander Iriondo
    Elena Lazkano
    Ander Ansuategi
    Andoni Rivera
    Iker Lluvia
    Carlos Tubío
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3003 - 3023