Efficient spam filtering through intelligent text modification detection using machine learning

被引:0
|
作者
Mageshkumar, N. [1 ]
Vijayaraj, A. [2 ]
Arunpriya, N. [3 ]
Sangeetha, A. [4 ]
机构
[1] Madanapalle Inst Technol & Sci, Dept Comp Sci & Technol, Madanapalle 517325, Chittor, India
[2] Deemed be Univ Vadlamudi, Dept Informat Technol, Vignans Fdn Sci Technol & Res, Guntur 522213, Andhra Pradesh, India
[3] Panimalar Engn Coll, Dept Elect Commun & Engn, Chennai 600123, India
[4] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad, India
关键词
Bayesian poisoning; Diacritics; Leetspeak; Naive Bayes; Spam filters; Spammer;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Spam emails have long been a source of concern in the field of computer security. They are both monetarily and technologically costly, as well as extremely harmful to computers and networks. Despite the rise of social networks and other Internet-based information exchange venues, email commu-nication has become increasingly important over time, necessitating the urgent improvement of spam fil-ters. Although various spam filters have been developed to help prevent spam emails from reaching a user's mailbox, there has been little research into text modifications. Because of its simplicity and effi-ciency, Naive Bayes is currently one of the most used methods of spam classification. However, when emails contain leetspeak or diacritics, Naive Bayes is unable to correctly categorize them. As a result, we created a novel method to improve the accuracy of the Naive Bayes Spam Filter to detect text alter-ations and correctly classify emails as Spam or ham in this proposal. When compared to Spamassassin, our Python approach uses a combination of semantic, keyword, and machine learning algorithms to improve Naive Bayes accuracy. Furthermore, we identified a link between email length and spam score, indicating that Bayesian Poisoning, a contentious concept, is an actual occurrence used by spammers.Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the International Confer-ence on Advanced Materials for Innovation and Sustainability.
引用
收藏
页码:848 / 858
页数:11
相关论文
共 50 条
  • [41] HSB-SPAM: An Efficient Image Filtering Detection Technique
    Agarwal, Saurabh
    Jung, Ki-Hyun
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [42] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Liu, Wuying
    Wang, Ting
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (03) : 505 - 518
  • [43] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Wuying Liu
    Ting Wang
    [J]. International Journal of Computational Intelligence Systems, 2012, 5 : 505 - 518
  • [44] Detection of emotion by text analysis using machine learning
    Machova, Kristina
    Szaboova, Martina
    Paralic, Jan
    Micko, Jan
    [J]. FRONTIERS IN PSYCHOLOGY, 2023, 14
  • [45] Intelligent phishing website detection using machine learning
    Jha, Ashish Kumar
    Muthalagu, Raja
    Pawar, Pranav M.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29431 - 29456
  • [46] Intelligent Flower Detection System Using Machine Learning
    Safar, Amna
    Safar, Maytham
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, 2020, 1038 : 463 - 472
  • [47] Intelligent phishing website detection using machine learning
    Ashish Kumar Jha
    Raja Muthalagu
    Pranav M. Pawar
    [J]. Multimedia Tools and Applications, 2023, 82 : 29431 - 29456
  • [48] SMS Spam Filtering on Multiple Background Datasets Using Machine Learning Techniques: A Novel Approach
    Kaliyar, Rohit Kumar
    Narang, Pratik
    Goswami, Anurag
    [J]. PROCEEDINGS OF THE 2018 IEEE 8TH INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC 2018), 2018, : 59 - 65
  • [49] Applicability of machine learning in spam and phishing email filtering: review and approaches
    Tushaar Gangavarapu
    C. D. Jaidhar
    Bhabesh Chanduka
    [J]. Artificial Intelligence Review, 2020, 53 : 5019 - 5081
  • [50] A Comparative Study of Machine Learning Techniques in Blog Comments Spam Filtering
    Romero, C.
    Valdez, M. Garcia
    Alanis, A.
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,