Email Classification and Forensics Analysis using Machine Learning

被引:10
|
作者
Hina, Maryam [1 ]
Ali, Mohsan [2 ]
Javed, Abdul Rehman [3 ]
Srivastava, Gautam [4 ]
Gadekallu, Thippa Reddy [5 ]
Jalil, Zunera [3 ]
机构
[1] Air Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Air Univ, Natl Ctr Cyber Secur, Islamabad, Pakistan
[3] Air Univ, Dept Cyber Secur, Islamabad, Pakistan
[4] Brandon Univ, Dept Math & Comp Sci, Brandon, MB R7A 6A9, Canada
[5] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
关键词
Digital Forensics; Machine Learning; Email Forensics; Fraud Detection; Crime Investigation;
D O I
10.1109/SWC50871.2021.00093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emails are being used as a reliable, secure, and formal mode of communication for a long time. With fast and secure communication technologies, reliance on Email has increased as well. The massive increase in email data has led to a big challenge in managing emails. Emails so far can be classified and grouped based on sender, size, and date. However, there is a need to detect and classify emails based on the contents contained therein. Several approaches have been used in the past for content-based classification of emails as Spam or Non-Spam Email. In this paper, we propose a multi-label email classification approach to organize emails. An efficient classification method has been proposed for forensic investigations of massive email data (e.g., a disk image of an email server). This method would help the investigator in Email related crimes investigations. A comparative study of machine learning algorithms identified Logistic Regression as a method that achieves the highest accuracy compared to Naive Bayes, Stochastic Gradient Descent, Random Forest, and Support Vector Machine. Experiments conducted on benchmark data sets depicted that logistic Regression performs best, with an accuracy of 91.9% with bi-gram features.
引用
收藏
页码:630 / 635
页数:6
相关论文
共 50 条
  • [21] Phishing Email Detection Using Machine Learning Techniques
    Alattas, Hussain
    Aljohar, Fay
    Aljunibi, Hawra
    Alweheibi, Muneera
    Alrashdi, Rawan
    Al Azman, Ghadeer
    Alharby, Abdulrahman
    Nagy, Naya
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 678 - 685
  • [22] Business Email Classification Using Incremental Subspace Learning
    Li, Min
    Park, Youngja
    Ma, Rui
    Huang, He Yuan
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 625 - 628
  • [23] Trustworthiness of Spam Email Addresses using Machine Learning
    Janez-Martino, Francisco
    Alaiz-Rodriguez, Rocio
    Gonzalez-Castro, Victor
    Fidalgo, Eduardo
    PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
  • [24] Machine learning with digital forensics for attack classification in cloud network environment
    Shaweta Sachdeva
    Aleem Ali
    International Journal of System Assurance Engineering and Management, 2022, 13 : 156 - 165
  • [25] Machine learning with digital forensics for attack classification in cloud network environment
    Sachdeva, Shaweta
    Ali, Aleem
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022, 13 (SUPPL 1) : 156 - 165
  • [26] BULK EMAIL FORENSICS
    Cohen, Fred
    ADVANCES IN DIGITAL FORENSICS V, 2009, 306 : 51 - 67
  • [27] Performance Analysis of Modulation Classification Using Machine learning
    Nisha, G.
    Vijayan, Vishnupriya
    Jose, Renu
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 70 - 74
  • [28] Email Spoofing Detection Using Volatile Memory Forensics
    Iyer, R. Padmavathi
    Atrey, Pradeep K.
    Varshney, Gaurav
    Misra, Manoj
    2017 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2017, : 619 - 625
  • [29] An Email Forensics Analysis Method Based on Social Network Analysis
    Liu, YanHua
    Chen, GuoLong
    Xie, Lili
    2013 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CLOUDCOM-ASIA), 2013, : 563 - 569
  • [30] Network Forensics Analysis of Cyber Attacks on Computer Systems using Machine Learning Techniques
    Yildiz, Firdevs
    Guel, Batuhan
    Ertam, Fatih
    ACTA INFOLOGICA, 2024, 8 (01): : 34 - 50