Evaluating spam filters and Stylometric Detection of AI-generated phishing emails

被引:0
|
作者
Opara, Chidimma [1 ]
Modesti, Paolo [1 ]
Golightly, Lewis [1 ]
机构
[1] Teesside Univ, Dept Comp & Games, Middlesbrough TS1 3BX, England
关键词
AI-generated phishing email; Phishing detection; Stylometric analysis; Large Language Models (LLMs); Machine learning; Cybersecurity;
D O I
10.1016/j.eswa.2025.127044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advanced architecture of Large Language Models (LLMs) has revolutionised natural language processing, enabling the creation of text that convincingly mimics legitimate human communication, including phishing emails. As AI-generated phishing emails become increasingly sophisticated, a critical question arises: How effectively can current email systems and detection mechanisms identify these threats? This study addresses this issue by analysing 63 AI-generated phishing emails created using GPT-4o. It evaluates the effectiveness of major email services, Gmail, Outlook, and Yahoo, in filtering these malicious communications. The findings reveal that Gmail and Outlook allowed more AI-generated phishing emails to bypass their filters compared to Yahoo, highlighting vulnerabilities in existing email filtering systems. To mitigate these challenges, we applied 60 stylometric features across four machine learning models: Logistic Regression, Support Vector Machine, Random Forest, and XGBoost. Among these, XGBoost demonstrated superior performance, achieving 96% accuracy and an AUC score of 99%. Key features such as imperative verb count, clause density, and first- person pronoun usage were instrumental to the model's success. The dataset of AI-generated phishing emails is publicly available on Kaggle to foster further research.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Integrating AI into clinical education: evaluating general practice trainees' proficiency in distinguishing AI-generated hallucinations and impacting factors
    Zhou, Jiacheng
    Zhang, Jintao
    Wan, Rongrong
    Cui, Xiaochuan
    Liu, Qiyu
    Guo, Hua
    Shi, Xiaofen
    Fu, Bingbing
    Meng, Jia
    Yue, Bo
    Zhang, Yunyun
    Zhang, Zhiyong
    BMC MEDICAL EDUCATION, 2025, 25 (01)
  • [42] Navigating the Landscape of AI-Generated Text Detection: Issues and Solutions for Upholding Academic Integrity
    Gupta, Varun
    Gupta, Chetna
    COMPUTER, 2024, 57 (11) : 118 - 123
  • [43] The Imitation Game revisited: A comprehensive survey on recent advances in AI-generated text detection
    Yang, Zhiwei
    Feng, Zhengjie
    Huo, Rongxin
    Lin, Huiru
    Zheng, Hanghan
    Nie, Ruichi
    Chen, Hongrui
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [44] DETECTION OF AI-GENERATED IMAGES: A MIXED METHODS STUDY ON AGE-RELATED DIFFERENCES
    Velazquez, Enilda
    Flores-Cruz, Gabriela
    Roque, Nelson
    INNOVATION IN AGING, 2024, 8 : 1301 - 1301
  • [45] Towards Fair Detection of AI-Generated Essays in Large-Scale Writing Assessments
    Jiang, Yang
    Hao, Jiangang
    Fauss, Michael
    Li, Chen
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, 2024, 2151 : 317 - 324
  • [46] Survey on AI-Generated Plagiarism Detection: The Impact of Large Language Models on Academic Integrity
    Pudasaini, Shushanta
    Miralles-Pechuan, Luis
    Lillis, David
    Llorens Salvador, Marisa
    JOURNAL OF ACADEMIC ETHICS, 2024,
  • [47] Investigating generative AI models and detection techniques: impacts of tokenization and dataset size on identification of AI-generated text
    Hua, Haowei
    Yao, Co-Jiayu
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [48] Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
    Aziz, Memoona
    Rehman, Umair
    Danish, Muhammad Umair
    Grolinger, Katarina
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2025, 55 (02) : 223 - 233
  • [49] Evaluating AI-Generated Questions: A Mixed-Methods Analysis Using Question Data and Student Perceptions
    Van Campenhout, Rachel
    Hubertz, Martha
    Johnson, Benny G.
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, 2022, 13355 : 344 - 353
  • [50] Evaluating diagnostic content of AI-generated chest radiography: A multi-center visual Turing test
    Myong, Youho
    Yoon, Dan
    Kim, Byeong Soo
    Kim, Young Gyun
    Sim, Yongsik
    Lee, Suji
    Yoon, Jiyoung
    Cho, Minwoo
    Kim, Sungwan
    PLOS ONE, 2023, 18 (04):