Efficient spam filtering through intelligent text modification detection using machine learning

被引：0

作者：

Mageshkumar, N. ^{[1
]}

Vijayaraj, A. ^{[2
]}

Arunpriya, N. ^{[3
]}

Sangeetha, A. ^{[4
]}

机构：

[1] Madanapalle Inst Technol & Sci, Dept Comp Sci & Technol, Madanapalle 517325, Chittor, India

[2] Deemed be Univ Vadlamudi, Dept Informat Technol, Vignans Fdn Sci Technol & Res, Guntur 522213, Andhra Pradesh, India

[3] Panimalar Engn Coll, Dept Elect Commun & Engn, Chennai 600123, India

[4] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad, India

来源：

MATERIALS TODAY-PROCEEDINGS | 2022年 / 64卷

关键词：

Bayesian poisoning; Diacritics; Leetspeak; Naive Bayes; Spam filters; Spammer;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Spam emails have long been a source of concern in the field of computer security. They are both monetarily and technologically costly, as well as extremely harmful to computers and networks. Despite the rise of social networks and other Internet-based information exchange venues, email commu-nication has become increasingly important over time, necessitating the urgent improvement of spam fil-ters. Although various spam filters have been developed to help prevent spam emails from reaching a user's mailbox, there has been little research into text modifications. Because of its simplicity and effi-ciency, Naive Bayes is currently one of the most used methods of spam classification. However, when emails contain leetspeak or diacritics, Naive Bayes is unable to correctly categorize them. As a result, we created a novel method to improve the accuracy of the Naive Bayes Spam Filter to detect text alter-ations and correctly classify emails as Spam or ham in this proposal. When compared to Spamassassin, our Python approach uses a combination of semantic, keyword, and machine learning algorithms to improve Naive Bayes accuracy. Furthermore, we identified a link between email length and spam score, indicating that Bayesian Poisoning, a contentious concept, is an actual occurrence used by spammers.Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the International Confer-ence on Advanced Materials for Innovation and Sustainability.

引用

页码：848 / 858

页数：11

共 50 条

[1] Efficient spam filtering through intelligent text modification detection using machine learning
Mageshkumar, N.
Vijayaraj, A.
Arunpriya, N.
Sangeetha, A.
[J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 848 - 858
[2] Enhancing the Naive Bayes Spam Filter through Intelligent Text Modification Detection
Huang, Linda
Jia, Julia
Ingram, Emma
Peng, Wuxu
[J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 849 - 854
[3] An Efficient Spam Detection Technique for IoT Devices Using Machine Learning
Makkar, Aaisha
Garg, Sahil
Kumar, Neeraj
Hossain, M. Shamim
Ghoneim, Ahmed
Alrashoud, Mubarak
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) : 903 - 912
[4] Learning Semantic Coherence for Machine Generated Spam Text Detection
Bao, Mengjiao
Li, Jianxin
Zhang, Jian
Peng, Hao
Liu, Xudong
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[5] Spam SMS filtering based on text features and supervised machine learning techniques
Muhammad Adeel Abid
Saleem Ullah
Muhammad Abubakar Siddique
Muhammad Faheem Mushtaq
Wajdi Aljedaani
Furqan Rustam
[J]. Multimedia Tools and Applications, 2022, 81 : 39853 - 39871
[6] Spam SMS filtering based on text features and supervised machine learning techniques
Abid, Muhammad Adeel
Ullah, Saleem
Siddique, Muhammad Abubakar
Mushtaq, Muhammad Faheem
Aljedaani, Wajdi
Rustam, Furqan
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 39853 - 39871
[7] Review Spam Detection using Machine Learning
Radovanovic, Drasko
Krstajic, Boza
[J]. 2018 23RD INTERNATIONAL SCIENTIFIC-PROFESSIONAL CONFERENCE ON INFORMATION TECHNOLOGY (IT), 2018,
[8] Spam Detection Using Machine Learning in R
Kumari, K. R. Vidya
Kavitha, C. R.
[J]. INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES (ICCNCT 2018), 2019, 15 : 55 - 64
[9] SMS Spam Filtering using Supervised Machine Learning Algorithms
Navaney, Pavas
Dubey, Gaurav
Rana, Ajay
[J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
[10] An Intelligent Spam Email Filtering Approach Using a Learning Classifier System
Al-Ajeli, Ahmed
Al-Shamery, Eman S.
Alubady, Raaid
[J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (03) : 233 - 244

← 1 2 3 4 5 →