Clustering-based attack detection for adversarial reinforcement learning

被引:0
|
作者
Majadas, Ruben [1 ]
Garcia, Javier [2 ]
Fernandez, Fernando [1 ]
机构
[1] Univ Carlos III Madrid, Dept Informat, Ave Univ 30, Madrid 28911, Spain
[2] Univ Santiago De Compostela, Rua Lope Gomez De Marzoa S-N, Santiago De Compostela 15782, Spain
关键词
Adversarial reinforcement learning; Adversarial attacks; Change-point detection; Clustering applications; MODEL;
D O I
10.1007/s10489-024-05275-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting malicious attacks presents a major challenge in the field of reinforcement learning (RL), as such attacks can force the victim to perform abnormal actions, with potentially severe consequences. To mitigate these risks, current research focuses on the enhancement of RL algorithms with efficient detection mechanisms, especially for real-world applications. Adversarial attacks have the potential to alter the environmental dynamics of a Markov Decision Process (MDP) perceived by an RL agent. Leveraging these changes in dynamics, we propose a novel approach to detect attacks. Our contribution can be summarized in two main aspects. Firstly, we propose a novel formalization of the attack detection problem that entails analyzing modifications made by attacks to the transition and reward dynamics within the environment. This problem can be framed as a context change detection problem, where the goal is to identify the transition from a "free-of-attack" situation to an "under-attack" scenario. To solve this problem, we propose a groundbreaking "model-free" clustering-based countermeasure. This approach consists of two essential steps: first, partitioning the transition space into clusters, and then using this partitioning to identify changes in environmental dynamics caused by adversarial attacks. To assess the efficiency of our detection method, we performed experiments on four established RL domains (grid-world, mountain car, carpole, and acrobot) and subjected them to four advanced attack types. Uniform, Strategically-timed, Q-value, and Multi-objective. Our study proves that our technique has a high potential for perturbation detection, even in scenarios where attackers employ more sophisticated strategies.
引用
收藏
页码:2631 / 2647
页数:17
相关论文
共 50 条
  • [21] Multiple-Model Based Defense for Deep Reinforcement Learning Against Adversarial Attack
    Chan, Patrick P. K.
    Wang, Yaxuan
    Kees, Natasha
    Yeung, Daniel S.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 42 - 53
  • [22] Incremental Adversarial Learning for Polymorphic Attack Detection
    Sabeel, Ulya
    Heydari, Shahram Shah
    El-Khatib, Khalil
    Elgazzar, Khalid
    IEEE Transactions on Machine Learning in Communications and Networking, 2024, 2 : 869 - 887
  • [23] Comparing Metaheuristic Search Techniques in Addressing the Effectiveness of Clustering-Based DDoS Attack Detection Methods
    Zeinalpour, Alireza
    McElroy, Charles P.
    ELECTRONICS, 2024, 13 (05)
  • [24] Adversarial robustness of deep reinforcement learning-based intrusion detection
    Merzouk, Mohamed Amine
    Neal, Christopher
    Delas, Josephine
    Yaich, Reda
    Boulahia-Cuppens, Nora
    Cuppens, Frederic
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, : 3625 - 3651
  • [25] Learning adversarial attack policies through multi-objective reinforcement learning
    Garcia, Javier
    Majadas, Ruben
    Fernandez, Fernando
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
  • [26] An Adversarial Reinforcement Learning Framework for Robust Machine Learning-based Malware Detection
    Ebrahimi, Mohammadreza
    Li, Weifeng
    Chai, Yidong
    Pacheco, Jason
    Chen, Hsinchun
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 567 - 576
  • [27] Clustering-based initialization of Learning Classifier Systems
    Tzima, Fani A.
    Mitkas, Pericles A.
    Theocharis, John B.
    SOFT COMPUTING, 2012, 16 (07) : 1267 - 1286
  • [28] Clustering-based Domain-Incremental Learning
    Lamers, Christiaan
    Vidal, Rene
    Belbachir, Nabil
    Van Stein, Niki
    Back, Thomas
    Giampouras, Paris
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3376 - 3384
  • [29] CLUSTERING-BASED FEATURE LEARNING ON VARIABLE STARS
    Mackenzie, Cristobal
    Pichara, Karim
    Protopapas, Pavlos
    ASTROPHYSICAL JOURNAL, 2016, 820 (02):
  • [30] Clustering-based Anomaly Detection for Smartphone Applications
    El Attar, Ali
    Khatoun, Rida
    Lemercier, Marc
    2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,