Label flipping attacks against Naive Bayes on spam filtering systems

被引：0

作者：

Hongpo Zhang

Ning Cheng

Yang Zhang

Zhanbo Li

机构：

[1] Zhengzhou University,School of Information Engineering

[2] Zhengzhou University,Cooperative Innovation Center of Internet Healthcare

[3] Zhengzhou University,School of Software

来源：

Applied Intelligence | 2021年 / 51卷

关键词：

Spam classification; Label flipping attacks; Naive Bayes classifier; Performance evaluation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Label flipping attack is a poisoning attack that flips the labels of training samples to reduce the classification performance of the model. Robustness is used to measure the applicability of machine learning algorithms to adversarial attack. Naive Bayes (NB) algorithm is a anti-noise and robust machine learning technique. It shows good robustness when dealing with issues such as document classification and spam filtering. Here we propose two novel label flipping attacks to evaluate the robustness of NB under label noise. For the three datasets of Spambase, TREC 2006c and TREC 2007 in the spam classification domain, our attack goal is to increase the false negative rate of NB under the influence of label noise without affecting normal mail classification. Our evaluation shows that at a noise level of 20%, the false negative rate of Spambase and TREC 2006c has increased by about 20%, and the test error of the TREC 2007 dataset has increased to nearly 30%. We compared the classification accuracy of five classic machine learning algorithms (random forest(RF), support vector machine(SVM), decision tree(DT), logistic regression(LR), and NB) and two deep learning models(AlexNet, LeNet) under the proposed label flipping attacks. The experimental results show that two label noises are suitable for various classification models and effectively reduce the accuracy of the models.

引用

页码：4503 / 4514

页数：11

共 50 条

[1] Label flipping attacks against Naive Bayes on spam filtering systems
Zhang, Hongpo
Cheng, Ning
Zhang, Yang
Li, Zhanbo
[J]. APPLIED INTELLIGENCE, 2021, 51 (07) : 4503 - 4514
[2] Understanding of the Naive Bayes Classifier in Spam Filtering
Wei, Qijia
[J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
[3] Spam Filtering:Online Naive Bayes Based on TONE
Guanglu Sun
Hongyue Sun
Yingcai Ma
Yuewu Shen
[J]. ZTE Communications, 2013, 11 (02) : 51 - 54
[4] On defending against label flipping attacks on malware detection systems
Taheri, Rahim
Javidan, Reza
Shojafar, Mohammad
Pooranian, Zahra
Miri, Ali
Conti, Mauro
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (18): : 14781 - 14800
[5] On defending against label flipping attacks on malware detection systems
Rahim Taheri
Reza Javidan
Mohammad Shojafar
Zahra Pooranian
Ali Miri
Mauro Conti
[J]. Neural Computing and Applications, 2020, 32 : 14781 - 14800
[6] Spam Filtering using Association Rules and Naive Bayes Classifier
Yang, Tianda
Qian, Kai
Lo, Dan Chia-Tien
Al Nasr, Kamal
Qian, Ying
[J]. PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 638 - 642
[7] Web Service-enabled Spam Filtering with Naive Bayes Classification
You, Wanqing
Qian, Kai
Lo, Dan
Bhattacharya, Prahir
Guo, Minzhe
Qian, Ying
[J]. 2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 99 - 104
[8] Word Embedding based Multinomial Naive Bayes Algorithm for Spam Filtering
Kadam, Sumedh
Gala, Aayush
Gehlot, Pritesh
Kurup, Aditya
Ghag, Kranti
[J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
[9] A Support Vector Machine based Naive Bayes Algorithm for Spam Filtering
Feng, Weimiao
Sun, Jianguo
Zhang, Liguo
Cao, Cuiling
Yang, Qing
[J]. 2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
[10] REVISED NAIVE BAYES CL ASSIFIER FOR COMBATING THE FOCUS ATTACK IN SPAM FILTERING
Peng, Junyan
Chan, Patrick P. K.
[J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 610 - 614

← 1 2 3 4 5 →