Efficient spam filtering through intelligent text modification detection using machine learning

被引:0
|
作者
Mageshkumar, N. [1 ]
Vijayaraj, A. [2 ]
Arunpriya, N. [3 ]
Sangeetha, A. [4 ]
机构
[1] Madanapalle Inst Technol & Sci, Dept Comp Sci & Technol, Madanapalle 517325, Chittor, India
[2] Deemed be Univ Vadlamudi, Dept Informat Technol, Vignans Fdn Sci Technol & Res, Guntur 522213, Andhra Pradesh, India
[3] Panimalar Engn Coll, Dept Elect Commun & Engn, Chennai 600123, India
[4] MLR Inst Technol, Dept Comp Sci & Engn, Hyderabad, India
关键词
Bayesian poisoning; Diacritics; Leetspeak; Naive Bayes; Spam filters; Spammer;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Spam emails have long been a source of concern in the field of computer security. They are both monetarily and technologically costly, as well as extremely harmful to computers and networks. Despite the rise of social networks and other Internet-based information exchange venues, email commu-nication has become increasingly important over time, necessitating the urgent improvement of spam fil-ters. Although various spam filters have been developed to help prevent spam emails from reaching a user's mailbox, there has been little research into text modifications. Because of its simplicity and effi-ciency, Naive Bayes is currently one of the most used methods of spam classification. However, when emails contain leetspeak or diacritics, Naive Bayes is unable to correctly categorize them. As a result, we created a novel method to improve the accuracy of the Naive Bayes Spam Filter to detect text alter-ations and correctly classify emails as Spam or ham in this proposal. When compared to Spamassassin, our Python approach uses a combination of semantic, keyword, and machine learning algorithms to improve Naive Bayes accuracy. Furthermore, we identified a link between email length and spam score, indicating that Bayesian Poisoning, a contentious concept, is an actual occurrence used by spammers.Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the International Confer-ence on Advanced Materials for Innovation and Sustainability.
引用
收藏
页码:848 / 858
页数:11
相关论文
共 50 条
  • [1] Efficient spam filtering through intelligent text modification detection using machine learning
    Mageshkumar, N.
    Vijayaraj, A.
    Arunpriya, N.
    Sangeetha, A.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 848 - 858
  • [2] Enhancing the Naive Bayes Spam Filter through Intelligent Text Modification Detection
    Huang, Linda
    Jia, Julia
    Ingram, Emma
    Peng, Wuxu
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 849 - 854
  • [3] An Efficient Spam Detection Technique for IoT Devices Using Machine Learning
    Makkar, Aaisha
    Garg, Sahil
    Kumar, Neeraj
    Hossain, M. Shamim
    Ghoneim, Ahmed
    Alrashoud, Mubarak
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (02) : 903 - 912
  • [4] Learning Semantic Coherence for Machine Generated Spam Text Detection
    Bao, Mengjiao
    Li, Jianxin
    Zhang, Jian
    Peng, Hao
    Liu, Xudong
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [5] Spam SMS filtering based on text features and supervised machine learning techniques
    Muhammad Adeel Abid
    Saleem Ullah
    Muhammad Abubakar Siddique
    Muhammad Faheem Mushtaq
    Wajdi Aljedaani
    Furqan Rustam
    [J]. Multimedia Tools and Applications, 2022, 81 : 39853 - 39871
  • [6] Spam SMS filtering based on text features and supervised machine learning techniques
    Abid, Muhammad Adeel
    Ullah, Saleem
    Siddique, Muhammad Abubakar
    Mushtaq, Muhammad Faheem
    Aljedaani, Wajdi
    Rustam, Furqan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 39853 - 39871
  • [7] Review Spam Detection using Machine Learning
    Radovanovic, Drasko
    Krstajic, Boza
    [J]. 2018 23RD INTERNATIONAL SCIENTIFIC-PROFESSIONAL CONFERENCE ON INFORMATION TECHNOLOGY (IT), 2018,
  • [8] Spam Detection Using Machine Learning in R
    Kumari, K. R. Vidya
    Kavitha, C. R.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES (ICCNCT 2018), 2019, 15 : 55 - 64
  • [9] SMS Spam Filtering using Supervised Machine Learning Algorithms
    Navaney, Pavas
    Dubey, Gaurav
    Rana, Ajay
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 43 - 48
  • [10] An Intelligent Spam Email Filtering Approach Using a Learning Classifier System
    Al-Ajeli, Ahmed
    Al-Shamery, Eman S.
    Alubady, Raaid
    [J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (03) : 233 - 244