Efficient spam filtering through intelligent text modification detection using machine learning

被引：0

作者：

Mageshkumar, N. ^{[1
]}

Vijayaraj, A. ^{[2
]}

Arunpriya, N. ^{[3
]}

Sangeetha, A. ^{[4
]}

机构：

[1] Madanapalle Inst Technol & Sci, Dept Comp Sci & Technol, Madanapalle 517325, Chittor, India

[2] Deemed be Univ Vadlamudi, Dept Informat Technol, Vignans Fdn Sci Technol & Res, Guntur 522213, Andhra Pradesh, India

[3] Panimalar Engn Coll, Dept Elect Commun & Engn, Chennai 600123, India

[4] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad, India

来源：

MATERIALS TODAY-PROCEEDINGS | 2022年 / 64卷

关键词：

Bayesian poisoning; Diacritics; Leetspeak; Naive Bayes; Spam filters; Spammer;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Spam emails have long been a source of concern in the field of computer security. They are both monetarily and technologically costly, as well as extremely harmful to computers and networks. Despite the rise of social networks and other Internet-based information exchange venues, email commu-nication has become increasingly important over time, necessitating the urgent improvement of spam fil-ters. Although various spam filters have been developed to help prevent spam emails from reaching a user's mailbox, there has been little research into text modifications. Because of its simplicity and effi-ciency, Naive Bayes is currently one of the most used methods of spam classification. However, when emails contain leetspeak or diacritics, Naive Bayes is unable to correctly categorize them. As a result, we created a novel method to improve the accuracy of the Naive Bayes Spam Filter to detect text alter-ations and correctly classify emails as Spam or ham in this proposal. When compared to Spamassassin, our Python approach uses a combination of semantic, keyword, and machine learning algorithms to improve Naive Bayes accuracy. Furthermore, we identified a link between email length and spam score, indicating that Bayesian Poisoning, a contentious concept, is an actual occurrence used by spammers.Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the International Confer-ence on Advanced Materials for Innovation and Sustainability.

引用

页码：848 / 858

页数：11

共 50 条

[41] HSB-SPAM: An Efficient Image Filtering Detection Technique
Agarwal, Saurabh
Jung, Ki-Hyun
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (09):
[42] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
Liu, Wuying
Wang, Ting
[J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (03) : 505 - 518
[43] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
Wuying Liu
Ting Wang
[J]. International Journal of Computational Intelligence Systems, 2012, 5 : 505 - 518
[44] Detection of emotion by text analysis using machine learning
Machova, Kristina
Szaboova, Martina
Paralic, Jan
Micko, Jan
[J]. FRONTIERS IN PSYCHOLOGY, 2023, 14
[45] Intelligent phishing website detection using machine learning
Jha, Ashish Kumar
Muthalagu, Raja
Pawar, Pranav M.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29431 - 29456
[46] Intelligent Flower Detection System Using Machine Learning
Safar, Amna
Safar, Maytham
[J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2020, 1038 : 463 - 472
[47] Intelligent phishing website detection using machine learning
Ashish Kumar Jha
Raja Muthalagu
Pranav M. Pawar
[J]. Multimedia Tools and Applications, 2023, 82 : 29431 - 29456
[48] SMS Spam Filtering on Multiple Background Datasets Using Machine Learning Techniques: A Novel Approach
Kaliyar, Rohit Kumar
Narang, Pratik
Goswami, Anurag
[J]. PROCEEDINGS OF THE 2018 IEEE 8TH INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC 2018), 2018, : 59 - 65
[49] Applicability of machine learning in spam and phishing email filtering: review and approaches
Tushaar Gangavarapu
C. D. Jaidhar
Bhabesh Chanduka
[J]. Artificial Intelligence Review, 2020, 53 : 5019 - 5081
[50] A Comparative Study of Machine Learning Techniques in Blog Comments Spam Filtering
Romero, C.
Valdez, M. Garcia
Alanis, A.
[J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,

← 1 2 3 4 5 →