Devising and Detecting Phishing Emails Using Large Language Models

被引:9
|
作者
Heiding, Fredrik [1 ,2 ]
Schneier, Bruce [3 ]
Vishwanath, Arun [4 ]
Bernstein, Jeremy [5 ]
Park, Peter S. [5 ]
机构
[1] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[2] KTH Royal Inst Technol, S-11428 Stockholm, Sweden
[3] Harvard Univ, Harvard Kennedy Sch, Cambridge, MA 02138 USA
[4] Avant Res Grp, Buffalo, NY 14214 USA
[5] MIT, Cambridge, MA 02139 USA
关键词
Phishing; large language models; social engineering; artificial intelligence;
D O I
10.1109/ACCESS.2024.3375882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
AI programs, built using large language models, make it possible to automatically create phishing emails based on a few data points about a user. The V-Triad is a set of rules for manually designing phishing emails to exploit our cognitive heuristics and biases. In this study, we compare the performance of phishing emails created automatically by GPT-4 and manually using the V-Triad. We also combine GPT-4 with the V-Triad to assess their combined potential. A fourth group, exposed to generic phishing emails, was our control group. We use a red teaming approach by simulating attackers and emailing 112 participants recruited for the study. The control group emails received a click-through rate between 19-28%, the GPT-generated emails 30-44%, emails generated by the V-Triad 69-79%, and emails generated by GPT and the V-Triad 43-81%. Each participant was asked to explain why they pressed or did not press a link in the email. These answers often contradict each other, highlighting the importance of personal differences. Next, we used four popular large language models (GPT, Claude, PaLM, and LLaMA) to detect the intention of phishing emails and compare the results to human detection. The language models demonstrated a strong ability to detect malicious intent, even in non-obvious phishing emails. They sometimes surpassed human detection, although often being slightly less accurate than humans. Finally, we analyze of the economic aspects of AI-enabled phishing attacks, showing how large language models increase the incentives of phishing and spear phishing by reducing their costs.
引用
收藏
页码:42131 / 42146
页数:16
相关论文
共 50 条
  • [41] Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark
    Hoelscher-Obermaier, Jason
    Persson, Julia H.
    Kran, Esben
    Konstas, Ioannis
    Barez, Fazl
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11548 - 11559
  • [42] The Performance of Sequential Deep Learning Models in Detecting Phishing Websites Using Contextual Features of URLs
    Gopali, Saroj
    Namin, Akbar S.
    Abri, Faranak
    Jones, Keith S.
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1064 - 1066
  • [43] Chasing the authoritarian spectre: Detecting authoritarian discourse with large language models
    Mochtak, Michal
    EUROPEAN JOURNAL OF POLITICAL RESEARCH, 2024,
  • [44] Detecting Data Races in OpenMP with Deep Learning and Large Language Models
    Alsofyani, May
    Wang, Liqiang
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 96 - 103
  • [45] Limits of Detecting Text Generated by Large-Scale Language Models
    Varshney, Lav R.
    Keskar, Nitish Shirish
    Socher, Richard
    2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2020,
  • [46] The use of large language models in detecting Chinese ultrasound report errors
    Yan, Yuqi
    Wang, Kai
    Feng, Bojian
    Yao, Jincao
    Jiang, Tian
    Jin, Zhiyan
    Zheng, Yin
    Zhou, Yahan
    Chen, Chen
    Sui, Lin
    Chen, Xiayi
    Du, Yanhong
    Yang, Jie
    Pan, Qianmeng
    Zhou, Lingyan
    Wang, Vicky Yang
    Liang, Ping
    Xu, Dong
    NPJ DIGITAL MEDICINE, 2025, 8 (01):
  • [47] Detecting Spear Phishing Attacks Using Machine Learning
    Regulagadda, Ramakrishna
    Krishna, M. Sai
    Prasanth, G.
    Sumalatha, V
    Ramesh, Y. Sai
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 1457 - 1459
  • [48] Detecting phishing websites using machine learning technique
    Dutta, Ashit Kumar
    PLOS ONE, 2021, 16 (10):
  • [49] Detecting Ambiguous Phishing Certificates using Machine Learning
    Homayoun, Sajad
    Hageman, Kaspar
    Afzal-Houshmand, Sam
    36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, : 1 - 6
  • [50] Using Large Language Models in Business Processes
    Grisold, Thomas
    vom Brocke, Jan
    Kratsch, Wolfgang
    Mendling, Jan
    Vidgof, Maxim
    BUSINESS PROCESS MANAGEMENT, BPM 2023, 2023, 14159 : XXIX - XXXI