Clustering-based attack detection for adversarial reinforcement learning

被引:0
|
作者
Majadas, Ruben [1 ]
Garcia, Javier [2 ]
Fernandez, Fernando [1 ]
机构
[1] Univ Carlos III Madrid, Dept Informat, Ave Univ 30, Madrid 28911, Spain
[2] Univ Santiago De Compostela, Rua Lope Gomez De Marzoa S-N, Santiago De Compostela 15782, Spain
关键词
Adversarial reinforcement learning; Adversarial attacks; Change-point detection; Clustering applications; MODEL;
D O I
10.1007/s10489-024-05275-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting malicious attacks presents a major challenge in the field of reinforcement learning (RL), as such attacks can force the victim to perform abnormal actions, with potentially severe consequences. To mitigate these risks, current research focuses on the enhancement of RL algorithms with efficient detection mechanisms, especially for real-world applications. Adversarial attacks have the potential to alter the environmental dynamics of a Markov Decision Process (MDP) perceived by an RL agent. Leveraging these changes in dynamics, we propose a novel approach to detect attacks. Our contribution can be summarized in two main aspects. Firstly, we propose a novel formalization of the attack detection problem that entails analyzing modifications made by attacks to the transition and reward dynamics within the environment. This problem can be framed as a context change detection problem, where the goal is to identify the transition from a "free-of-attack" situation to an "under-attack" scenario. To solve this problem, we propose a groundbreaking "model-free" clustering-based countermeasure. This approach consists of two essential steps: first, partitioning the transition space into clusters, and then using this partitioning to identify changes in environmental dynamics caused by adversarial attacks. To assess the efficiency of our detection method, we performed experiments on four established RL domains (grid-world, mountain car, carpole, and acrobot) and subjected them to four advanced attack types. Uniform, Strategically-timed, Q-value, and Multi-objective. Our study proves that our technique has a high potential for perturbation detection, even in scenarios where attackers employ more sophisticated strategies.
引用
收藏
页码:2631 / 2647
页数:17
相关论文
共 50 条
  • [31] Implementation of a Clustering-Based LDDoS Detection Method
    Hussain, Tariq
    Saeed, Muhammad Irfan
    Khan, Irfan Ullah
    Aslam, Nida
    Aljameel, Sumayh S.
    ELECTRONICS, 2022, 11 (18)
  • [32] Fuzzy Clustering-Based Approach for Outlier Detection
    Al-Zoubi, Moh'd Belal
    Ali, Al-Dahoud
    Yahya, Abdelfatah A.
    RECENT ADVANCES AND APPLICATIONS OF COMPUTER ENGINEERING: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE (ACE 10), 2010, : 192 - +
  • [33] Spam Detection Using Clustering-Based SVM
    Pandya, Darshit
    PROCEEDINGS OF THE 2019 2ND INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND MACHINE INTELLIGENCE (MLMI 2019), 2019, : 12 - 15
  • [34] Clustering-Based Network Intrusion Detection System
    Fan, Chun-I
    Lai, Yen-Lin
    Shie, Cheng-Han
    2022 5TH IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (IEEE DSC 2022), 2022,
  • [35] Clustering-Based Discriminant Analysis for Eye Detection
    Chen, Shuo
    Liu, Chengjun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (04) : 1629 - 1638
  • [36] Highly transferable adversarial attack against deep-reinforcement-learning-based frequency control
    Li, Zhongwei
    Liu, Yang
    Qiu, Peng
    Yin, Hongyan
    Wan, Xu
    Sun, Mingyang
    Energy Conversion and Economics, 2023, 4 (03): : 202 - 212
  • [37] Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition Models
    Wang, Zeyuan
    Sha, Chaofeng
    Yang, Su
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3162 - 3168
  • [38] SSQLi: A Black-Box Adversarial Attack Method for SQL Injection Based on Reinforcement Learning
    Guan, Yuting
    He, Junjiang
    Li, Tao
    Zhao, Hui
    Ma, Baoqiang
    FUTURE INTERNET, 2023, 15 (04):
  • [39] Data clustering-based fault detection in WSNs
    Yang, Yang
    Liu, Qian
    Gao, Zhipeng
    Qiu, Xuesong
    Rui, Lanlan
    2015 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2015, : 334 - 339
  • [40] Destabilizing Attack and Robust Defense for Inverter-Based Microgrids by Adversarial Deep Reinforcement Learning
    Wang, Yu
    Pal, Bikash C.
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (06) : 4839 - 4850