Predictive analytics for spam email classification using machine learning techniques

被引:8
|
作者
Kumar, Pradeep [1 ]
机构
[1] Department of Computer Science and Information Technology, Maulana Azad National Urdu University, Hyderabad, Telangana, India
关键词
Predictive analytics;
D O I
10.1504/IJCAT.2020.111844
中图分类号
学科分类号
摘要
Automated text classification is the most widely used approach to manage an enormous amount of unstructured text data in digital forms, which is continuously increasing across the globe. Machine learning techniques are applied for automatic email filtering effectively to detect the spam mail and prevent them from delivering into the user's inbox. This paper used Logistic regression, k-Nearest Neighbours (k-NN), Naive Bayes, Decision Trees, AdaBoost, ANNs, and SVMs for spam email classification. All the classifiers are learned, and the performance measured in terms of precision, recall, and accuracy using a set of systematic experiments conducted on the Spambase data set extracted from the UCI Machine Learning Repository. The effectiveness of each model is empirically illustrated to find a better and viable alternative model. The quantitative performance analysis of supervised and hybrid learning techniques is presented in detail. Experimental results indicate that ensemble methods outperform in terms of accuracy compared with other methods applied. Copyright © 2020 Inderscience Enterprises Ltd.
引用
收藏
页码:282 / 296
相关论文
共 50 条
  • [1] Predictive analytics for spam email classification using machine learning techniques
    Kumar, Pradeep
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (03) : 282 - 296
  • [2] A Proposed Data Science Approach for Email Spam Classification using Machine Learning Techniques
    Alurkar, Aakash Atul
    Ranade, Sourabh Bharat
    Joshi, Shreeya Vijay
    Ranade, Siddhesh Sanjay
    Sonewar, Piyush A.
    Mahalle, Parikshit N.
    Deshpande, Arvind V.
    [J]. 2017 JOINT 13TH CTTE AND 10TH CMI CONFERENCE ON INTERNET OF THINGS - BUSINESS MODELS, USERS, AND NETWORKS, 2017,
  • [3] A Comprehensive Review on Email Spam Classification using Machine Learning Algorithms
    Raza, Mansoor
    Jayasinghe, Nathali Dilshani
    Muslam, Muhana Magboul Ali
    [J]. 35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 327 - 332
  • [4] Email Spam Classification and Detection using Various Machine Learning Classifiers
    Saraswathi, N.
    Pradeep, S.
    Sathiyavathi, V.
    Sabitha, K.
    Kambattan, K. Rajesh
    [J]. 2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [5] Spam Email Detection Using Deep Learning Techniques
    AbdulNabi, Isra'a
    Yaseen, Qussai
    [J]. 12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 853 - 858
  • [6] Trustworthiness of Spam Email Addresses using Machine Learning
    Janez-Martino, Francisco
    Alaiz-Rodriguez, Rocio
    Gonzalez-Castro, Victor
    Fidalgo, Eduardo
    [J]. PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
  • [7] Email Spam Classification by Support Vector Machine
    Singh, Manmohan
    Pamula, Rajendra
    Shekhar, Shudhanshu Kumar
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 878 - 882
  • [8] Classification of Phishing Email Using Word Embedding and Machine Learning Techniques
    Somesha, M.
    Pais, Alwyn R.
    [J]. Journal of Cyber Security and Mobility, 2022, 11 (03): : 279 - 320
  • [9] Design of a Machine Learning Based Predictive Analytics System for Spam Problem
    Yuksel, A. S.
    Cankaya, S. F.
    Uncu, I. S.
    [J]. ACTA PHYSICA POLONICA A, 2017, 132 (03) : 500 - 504
  • [10] Healthcare predictive analytics using machine learning and deep learning techniques: a survey
    Mohammed Badawy
    Nagy Ramadan
    Hesham Ahmed Hefny
    [J]. Journal of Electrical Systems and Information Technology, 10 (1)