Label flipping attacks against Naive Bayes on spam filtering systems

被引:0
|
作者
Hongpo Zhang
Ning Cheng
Yang Zhang
Zhanbo Li
机构
[1] Zhengzhou University,School of Information Engineering
[2] Zhengzhou University,Cooperative Innovation Center of Internet Healthcare
[3] Zhengzhou University,School of Software
来源
Applied Intelligence | 2021年 / 51卷
关键词
Spam classification; Label flipping attacks; Naive Bayes classifier; Performance evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
Label flipping attack is a poisoning attack that flips the labels of training samples to reduce the classification performance of the model. Robustness is used to measure the applicability of machine learning algorithms to adversarial attack. Naive Bayes (NB) algorithm is a anti-noise and robust machine learning technique. It shows good robustness when dealing with issues such as document classification and spam filtering. Here we propose two novel label flipping attacks to evaluate the robustness of NB under label noise. For the three datasets of Spambase, TREC 2006c and TREC 2007 in the spam classification domain, our attack goal is to increase the false negative rate of NB under the influence of label noise without affecting normal mail classification. Our evaluation shows that at a noise level of 20%, the false negative rate of Spambase and TREC 2006c has increased by about 20%, and the test error of the TREC 2007 dataset has increased to nearly 30%. We compared the classification accuracy of five classic machine learning algorithms (random forest(RF), support vector machine(SVM), decision tree(DT), logistic regression(LR), and NB) and two deep learning models(AlexNet, LeNet) under the proposed label flipping attacks. The experimental results show that two label noises are suitable for various classification models and effectively reduce the accuracy of the models.
引用
收藏
页码:4503 / 4514
页数:11
相关论文
共 50 条
  • [1] Label flipping attacks against Naive Bayes on spam filtering systems
    Zhang, Hongpo
    Cheng, Ning
    Zhang, Yang
    Li, Zhanbo
    [J]. APPLIED INTELLIGENCE, 2021, 51 (07) : 4503 - 4514
  • [2] Understanding of the Naive Bayes Classifier in Spam Filtering
    Wei, Qijia
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
  • [3] Spam Filtering:Online Naive Bayes Based on TONE
    Guanglu Sun
    Hongyue Sun
    Yingcai Ma
    Yuewu Shen
    [J]. ZTE Communications, 2013, 11 (02) : 51 - 54
  • [4] On defending against label flipping attacks on malware detection systems
    Taheri, Rahim
    Javidan, Reza
    Shojafar, Mohammad
    Pooranian, Zahra
    Miri, Ali
    Conti, Mauro
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (18): : 14781 - 14800
  • [5] On defending against label flipping attacks on malware detection systems
    Rahim Taheri
    Reza Javidan
    Mohammad Shojafar
    Zahra Pooranian
    Ali Miri
    Mauro Conti
    [J]. Neural Computing and Applications, 2020, 32 : 14781 - 14800
  • [6] Spam Filtering using Association Rules and Naive Bayes Classifier
    Yang, Tianda
    Qian, Kai
    Lo, Dan Chia-Tien
    Al Nasr, Kamal
    Qian, Ying
    [J]. PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 638 - 642
  • [7] Web Service-enabled Spam Filtering with Naive Bayes Classification
    You, Wanqing
    Qian, Kai
    Lo, Dan
    Bhattacharya, Prahir
    Guo, Minzhe
    Qian, Ying
    [J]. 2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 99 - 104
  • [8] Word Embedding based Multinomial Naive Bayes Algorithm for Spam Filtering
    Kadam, Sumedh
    Gala, Aayush
    Gehlot, Pritesh
    Kurup, Aditya
    Ghag, Kranti
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [9] A Support Vector Machine based Naive Bayes Algorithm for Spam Filtering
    Feng, Weimiao
    Sun, Jianguo
    Zhang, Liguo
    Cao, Cuiling
    Yang, Qing
    [J]. 2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
  • [10] REVISED NAIVE BAYES CL ASSIFIER FOR COMBATING THE FOCUS ATTACK IN SPAM FILTERING
    Peng, Junyan
    Chan, Patrick P. K.
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 610 - 614