Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies

被引:1
|
作者
Alshamrani, Adel [1 ]
Alshahrani, Abdullah [2 ]
机构
[1] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Jeddah, Saudi Arabia
[2] Univ Jeddah, Coll Comp Sci & Engn, Dept Comp Sci & Artificial Intelligence, Jeddah, Saudi Arabia
来源
INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2023年 / 36卷 / 03期
关键词
Multiarmed bandits; reinforcement learning; multiagents; intrusion detection systems; COMPLEXITY;
D O I
10.32604/iasc.2023.032835
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems. In this paper, we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber defense. We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning (MARL). Our proposed approach is inspired by the multiarmed bandits (MAB) learning technique for multiple agents to cooperate in decision making or to work independently. We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system. We find that this game can be modeled from an individual player's perspective as a restless MAB problem. We discover further results when the MAB takes the form of a pure birth process, such as a myopic optimal policy, as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.
引用
收藏
页码:2757 / 2771
页数:15
相关论文
共 50 条
  • [31] Cooperative channel assignment for VANETs based on multiagent reinforcement learning
    Wang, Yun-peng
    Zheng, Kun-xian
    Tian, Da-xin
    Duan, Xu-ting
    Zhou, Jian-shan
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (07) : 1047 - 1058
  • [32] Adaptive Incentive for Cross-Silo Federated Learning in IIoT: A Multiagent Reinforcement Learning Approach
    Yuan, Shijing
    Dong, Beiyu
    Lv, Hongtao
    Liu, Hongze
    Chen, Hongyang
    Wu, Chentao
    Guo, Song
    Ding, Yue
    Li, Jie
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (09): : 15048 - 15058
  • [33] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [34] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [35] Gradient based method for symmetric and asymmetric multiagent reinforcement learning
    Könönen, V
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 68 - 75
  • [36] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
    Devlin, Sam
    Yliniemi, Logan
    Kudenko, Daniel
    Tumer, Kagan
    AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
  • [37] Exponential moving average based multiagent reinforcement learning algorithms
    Mostafa D. Awheda
    Howard M. Schwartz
    Artificial Intelligence Review, 2016, 45 : 299 - 332
  • [38] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
    Phon-Amnuaisuk, Somnuk
    NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579
  • [39] Voting-Based Multiagent Reinforcement Learning for Intelligent IoT
    Xu, Yue
    Deng, Zengde
    Wang, Mengdi
    Xu, Wenjun
    So, Anthony Man-Cho
    Cui, Shuguang
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 2681 - 2693
  • [40] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
    Wang, Rose E.
    Kew, J. Chase
    Lee, Dennis
    Lee, Tsang-Wei Edward
    Zhang, Tingnan
    Ichter, Brian
    Tan, Jie
    Faust, Aleksandra
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725