A Sampling-Based Method for Detecting Data Poisoning Attacks in Recommendation Systems

被引:0
|
作者
Li, Mohan [1 ]
Lian, Yuxin [1 ]
Zhu, Jinpeng [1 ]
Lin, Jingyi [1 ]
Wan, Jiawen [1 ]
Sun, Yanbin [1 ]
机构
[1] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
关键词
data poisoning; recommendation systems; ensemble learning; data poisoning detection; FRAMEWORK;
D O I
10.3390/math12020247
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The recommendation algorithm based on collaborative filtering is vulnerable to data poisoning attacks, wherein attackers can manipulate system output by injecting a large volume of fake rating data. To address this issue, it is essential to investigate methods for detecting systematically injected poisoning data within the rating matrix. Since attackers often inject a significant quantity of poisoning data in a short period to achieve their desired impact, these data may exhibit spatial proximity. In other words, poisoning data may be concentrated in adjacent rows of the rating matrix. This paper capitalizes on the proximity characteristics of poisoning data in the rating matrix and introduces a sampling-based method for detecting data poisoning attacks. First, we designed a rating matrix sampling method specifically for detecting poisoning data. By sampling differences obtained from the original rating matrix, it is possible to infer the presence of poisoning attacks and effectively discard poisoning data. Second, we developed a method for pinpointing malicious data based on the distance of rating vectors. Through distance calculations, we can accurately identify the positions of malicious data. After that, we validated the method on three real-world datasets. The results demonstrate the effectiveness of our method in identifying malicious data within the rating matrix.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Negative Sampling-Based Service Recommendation Method
    Xie, Ziming
    Cao, Buqing
    Liyan, Xinwen
    Tang, Bing
    Qing, Yueying
    Xie, Xiang
    Wang, Siyuan
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT I, 2022, 460 : 3 - 19
  • [2] Detecting Incipient Faults in Software Systems: A Compressed Sampling-based Approach
    DeCelles, Salvador
    Huang, Tingshan
    Stamm, Matthew C.
    Kandasamy, Nagarajan
    PROCEEDINGS OF 2016 IEEE 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2016, : 303 - 310
  • [3] Detecting Unknown Shilling Attacks in Recommendation Systems
    Singh, Pradeep Kumar
    Pramanik, Pijush Kanti Dutta
    Sinhababu, Nilanjan
    Choudhury, Prasenjit
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 137 (01) : 259 - 286
  • [4] Data Poisoning Attacks on Cross-domain Recommendation
    Chen, Huiyuan
    Li, Jing
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2177 - 2180
  • [5] Synchronization of Sampling-Based Measuring Systems
    Overney, Frederic
    Mortara, Alessandro
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2014, 63 (01) : 89 - 95
  • [6] ParticleAugment: Sampling-based data augmentation
    Tsaregorodtsev, Alexander
    Belagiannis, Vasileios
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 228
  • [7] Detecting Poisoning Attacks on Hierarchical Malware Classification Systems
    Guralnik, Dan P.
    Moran, Bill
    Pezeshki, Ali
    Arslan, Omur
    CYBER SENSING 2017, 2017, 10185
  • [8] Data poisoning attacks on neighborhood-based recommender systems
    Chen, Liang
    Xu, Yangjun
    Xie, Fenfang
    Huang, Min
    Zheng, Zibin
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (06)
  • [9] Data Poisoning Attacks to Deep Learning Based Recommender Systems
    Huang, Hai
    Mu, Jiaming
    Gong, Neil Zhenqiang
    Li, Qi
    Liu, Bin
    Xu, Mingwei
    28TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2021), 2021,
  • [10] A sampling-based method for dynamic scheduling in distributed data mining environment
    Li, Jifang
    WSEAS Transactions on Computers, 2009, 8 (01): : 63 - 72