Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies

被引：1

作者：

Alshamrani, Adel ^{[1
]}

Alshahrani, Abdullah ^{[2
]}

机构：

[1] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Jeddah, Saudi Arabia

[2] Univ Jeddah, Coll Comp Sci & Engn, Dept Comp Sci & Artificial Intelligence, Jeddah, Saudi Arabia

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2023年 / 36卷 / 03期

关键词：

Multiarmed bandits; reinforcement learning; multiagents; intrusion detection systems; COMPLEXITY;

D O I：

10.32604/iasc.2023.032835

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems. In this paper, we investigate a problem where multiagent sys-tems sensing and acting in an environment contribute to adaptive cyber defense. We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning (MARL). Our proposed approach is inspired by the multiarmed bandits (MAB) learning technique for multiple agents to cooperate in decision making or to work independently. We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system. We find that this game can be modeled from an individual player's perspective as a restless MAB problem. We discover further results when the MAB takes the form of a pure birth process, such as a myopic optimal policy, as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.

引用

页码：2757 / 2771

页数：15

共 50 条

[31] Cooperative channel assignment for VANETs based on multiagent reinforcement learning
Wang, Yun-peng
Zheng, Kun-xian
Tian, Da-xin
Duan, Xu-ting
Zhou, Jian-shan
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (07) : 1047 - 1058
[32] Adaptive Incentive for Cross-Silo Federated Learning in IIoT: A Multiagent Reinforcement Learning Approach
Yuan, Shijing
Dong, Beiyu
Lv, Hongtao
Liu, Hongze
Chen, Hongyang
Wu, Chentao
Guo, Song
Ding, Yue
Li, Jie
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (09): : 15048 - 15058
[33] Lateral Transfer Learning for Multiagent Reinforcement Learning
Shi, Haobin
Li, Jingchen
Mao, Jiahui
Hwang, Kao-Shing
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
[34] Learning to Teach in Cooperative Multiagent Reinforcement Learning
Omidshafiei, Shayegan
Kim, Dong-Ki
Liu, Miao
Tesauro, Gerald
Riemer, Matthew
Amato, Christopher
Campbell, Murray
How, Jonathan P.
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
[35] Gradient based method for symmetric and asymmetric multiagent reinforcement learning
Könönen, V
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 68 - 75
[36] Potential-Based Difference Rewards for Multiagent Reinforcement Learning
Devlin, Sam
Yliniemi, Logan
Kudenko, Daniel
Tumer, Kagan
AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172
[37] Exponential moving average based multiagent reinforcement learning algorithms
Mostafa D. Awheda
Howard M. Schwartz
Artificial Intelligence Review, 2016, 45 : 299 - 332
[38] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
Phon-Amnuaisuk, Somnuk
NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579
[39] Voting-Based Multiagent Reinforcement Learning for Intelligent IoT
Xu, Yue
Deng, Zengde
Wang, Mengdi
Xu, Wenjun
So, Anthony Man-Cho
Cui, Shuguang
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (04) : 2681 - 2693
[40] Model-based Reinforcement Learning for Decentralized Multiagent Rendezvous
Wang, Rose E.
Kew, J. Chase
Lee, Dennis
Lee, Tsang-Wei Edward
Zhang, Tingnan
Ichter, Brian
Tan, Jie
Faust, Aleksandra
CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 711 - 725

← 1 2 3 4 5 →