Email Classification and Forensics Analysis using Machine Learning

被引:10
|
作者
Hina, Maryam [1 ]
Ali, Mohsan [2 ]
Javed, Abdul Rehman [3 ]
Srivastava, Gautam [4 ]
Gadekallu, Thippa Reddy [5 ]
Jalil, Zunera [3 ]
机构
[1] Air Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Air Univ, Natl Ctr Cyber Secur, Islamabad, Pakistan
[3] Air Univ, Dept Cyber Secur, Islamabad, Pakistan
[4] Brandon Univ, Dept Math & Comp Sci, Brandon, MB R7A 6A9, Canada
[5] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
关键词
Digital Forensics; Machine Learning; Email Forensics; Fraud Detection; Crime Investigation;
D O I
10.1109/SWC50871.2021.00093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emails are being used as a reliable, secure, and formal mode of communication for a long time. With fast and secure communication technologies, reliance on Email has increased as well. The massive increase in email data has led to a big challenge in managing emails. Emails so far can be classified and grouped based on sender, size, and date. However, there is a need to detect and classify emails based on the contents contained therein. Several approaches have been used in the past for content-based classification of emails as Spam or Non-Spam Email. In this paper, we propose a multi-label email classification approach to organize emails. An efficient classification method has been proposed for forensic investigations of massive email data (e.g., a disk image of an email server). This method would help the investigator in Email related crimes investigations. A comparative study of machine learning algorithms identified Logistic Regression as a method that achieves the highest accuracy compared to Naive Bayes, Stochastic Gradient Descent, Random Forest, and Support Vector Machine. Experiments conducted on benchmark data sets depicted that logistic Regression performs best, with an accuracy of 91.9% with bi-gram features.
引用
收藏
页码:630 / 635
页数:6
相关论文
共 50 条
  • [1] Predictive analytics for spam email classification using machine learning techniques
    Kumar P.
    International Journal of Computer Applications in Technology, 2020, 64 (03): : 282 - 296
  • [2] A Comprehensive Review on Email Spam Classification using Machine Learning Algorithms
    Raza, Mansoor
    Jayasinghe, Nathali Dilshani
    Muslam, Muhana Magboul Ali
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 327 - 332
  • [3] Email Spam Classification and Detection using Various Machine Learning Classifiers
    Saraswathi, N.
    Pradeep, S.
    Sathiyavathi, V.
    Sabitha, K.
    Kambattan, K. Rajesh
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [4] Classification of Phishing Email Using Random Forest Machine Learning Technique
    Akinyelu, Andronicus A.
    Adewumi, Aderemi O.
    JOURNAL OF APPLIED MATHEMATICS, 2014,
  • [5] Predictive analytics for spam email classification using machine learning techniques
    Kumar, Pradeep
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (03) : 282 - 296
  • [6] Classification of Phishing Email Using Word Embedding and Machine Learning Techniques
    Somesha M.
    Pais A.R.
    Journal of Cyber Security and Mobility, 2022, 11 (03): : 279 - 320
  • [7] Analysis of Email Communication and Forensics
    Liu Yangyang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET TECHNOLOGY AND SECURITY (ITS 2010), 2010, : 5 - 9
  • [8] An Empirical Study on Email Classification Using Supervised Machine Learning in Real Environments
    Li, Wenjuan
    Meng, Weizhi
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 7438 - 7443
  • [9] Adaptive Machine Learning Approach for Emotional Email Classification
    Karthik, K.
    Ponnusamy, R.
    HUMAN-COMPUTER INTERACTION: TOWARDS MOBILE AND INTELLIGENT INTERACTION ENVIRONMENTS, PT III, 2011, 6763 : 552 - 558
  • [10] A Proposed Data Science Approach for Email Spam Classification using Machine Learning Techniques
    Alurkar, Aakash Atul
    Ranade, Sourabh Bharat
    Joshi, Shreeya Vijay
    Ranade, Siddhesh Sanjay
    Sonewar, Piyush A.
    Mahalle, Parikshit N.
    Deshpande, Arvind V.
    2017 JOINT 13TH CTTE AND 10TH CMI CONFERENCE ON INTERNET OF THINGS - BUSINESS MODELS, USERS, AND NETWORKS, 2017,