Devising and Detecting Phishing Emails Using Large Language Models

被引：9

作者：

Heiding, Fredrik ^{[1
,2
]}

Schneier, Bruce ^{[3
]}

Vishwanath, Arun ^{[4
]}

Bernstein, Jeremy ^{[5
]}

Park, Peter S. ^{[5
]}

机构：

[1] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA

[2] KTH Royal Inst Technol, S-11428 Stockholm, Sweden

[3] Harvard Univ, Harvard Kennedy Sch, Cambridge, MA 02138 USA

[4] Avant Res Grp, Buffalo, NY 14214 USA

[5] MIT, Cambridge, MA 02139 USA

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Phishing; large language models; social engineering; artificial intelligence;

D O I：

10.1109/ACCESS.2024.3375882

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

AI programs, built using large language models, make it possible to automatically create phishing emails based on a few data points about a user. The V-Triad is a set of rules for manually designing phishing emails to exploit our cognitive heuristics and biases. In this study, we compare the performance of phishing emails created automatically by GPT-4 and manually using the V-Triad. We also combine GPT-4 with the V-Triad to assess their combined potential. A fourth group, exposed to generic phishing emails, was our control group. We use a red teaming approach by simulating attackers and emailing 112 participants recruited for the study. The control group emails received a click-through rate between 19-28%, the GPT-generated emails 30-44%, emails generated by the V-Triad 69-79%, and emails generated by GPT and the V-Triad 43-81%. Each participant was asked to explain why they pressed or did not press a link in the email. These answers often contradict each other, highlighting the importance of personal differences. Next, we used four popular large language models (GPT, Claude, PaLM, and LLaMA) to detect the intention of phishing emails and compare the results to human detection. The language models demonstrated a strong ability to detect malicious intent, even in non-obvious phishing emails. They sometimes surpassed human detection, although often being slightly less accurate than humans. Finally, we analyze of the economic aspects of AI-enabled phishing attacks, showing how large language models increase the incentives of phishing and spear phishing by reducing their costs.

引用

页码：42131 / 42146

页数：16

共 50 条

[31] Detecting and monitoring concerns against HPV vaccination on social media using large language models
Rai, Sunny
Kornides, Melanie
Morgan, Jennifer
Kumar, Aman
Cappella, Joseph
Guntuku, Sharath Chandra
SCIENTIFIC REPORTS, 2024, 14 (01):
[32] Prompt Engineering or Fine-Tuning? A Case Study on Phishing Detection with Large Language Models
Trad, Fouad
Chehab, Ali
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (01): : 367 - 384
[33] Detecting Phishing Websites Using Machine Learning
Alswailem, Amani
Alabdullah, Bashayr
Alrumayh, Norah
Alsedrani, Aram
2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
[34] Benchmarking and Evaluating Large Language Models in Phishing Detection for Small and Midsize Enterprises: A Comprehensive Analysis
Zhang, Jun
Wu, Peiqiao
London, Jeffrey
Tenney, Dan
IEEE ACCESS, 2025, 13 : 28335 - 28352
[35] Using large language models in psychology
Demszky, Dorottya
Yang, Diyi
Yeager, David
Bryan, Christopher
Clapper, Margarett
Chandhok, Susannah
Eichstaedt, Johannes
Hecht, Cameron
Jamieson, Jeremy
Johnson, Meghann
Jones, Michaela
Krettek-Cobb, Danielle
Lai, Leslie
Jonesmitchell, Nirel
Ong, Desmond
Dweck, Carol
Gross, James
Pennebaker, James
NATURE REVIEWS PSYCHOLOGY, 2023, 2 (11): : 688 - 701
[36] Using large language models in psychology
Dorottya Demszky
Diyi Yang
David S. Yeager
Christopher J. Bryan
Margarett Clapper
Susannah Chandhok
Johannes C. Eichstaedt
Cameron Hecht
Jeremy Jamieson
Meghann Johnson
Michaela Jones
Danielle Krettek-Cobb
Leslie Lai
Nirel JonesMitchell
Desmond C. Ong
Carol S. Dweck
James J. Gross
James W. Pennebaker
Nature Reviews Psychology, 2023, 2 : 688 - 701
[37] Using large language models wisely
不详
NATURE ASTRONOMY, 2025, 9 (03): : 315 - 315
[38] Detecting Phishing Domains Using Machine Learning
Alnemari, Shouq
Alshammari, Majid
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[39] Detecting Phishing Website Using Machine Learning
Alkawaz, Mohammed Hazim
Steven, Stephanie Joanne
Hajamydeen, Asif Iqbal
2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), 2020, : 111 - 114
[40] Detecting unknown network attacks using language models
Rieck, Konrad
Laskov, Pavel
DETECTION OF INTRUSIONS AND MALWARE & VULNERABILITY ASSESSMENT, PROCEEDINGS, 2006, 4064 : 74 - 90

← 1 2 3 4 5 →