Devising and Detecting Phishing Emails Using Large Language Models

被引:9
|
作者
Heiding, Fredrik [1 ,2 ]
Schneier, Bruce [3 ]
Vishwanath, Arun [4 ]
Bernstein, Jeremy [5 ]
Park, Peter S. [5 ]
机构
[1] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[2] KTH Royal Inst Technol, S-11428 Stockholm, Sweden
[3] Harvard Univ, Harvard Kennedy Sch, Cambridge, MA 02138 USA
[4] Avant Res Grp, Buffalo, NY 14214 USA
[5] MIT, Cambridge, MA 02139 USA
关键词
Phishing; large language models; social engineering; artificial intelligence;
D O I
10.1109/ACCESS.2024.3375882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
AI programs, built using large language models, make it possible to automatically create phishing emails based on a few data points about a user. The V-Triad is a set of rules for manually designing phishing emails to exploit our cognitive heuristics and biases. In this study, we compare the performance of phishing emails created automatically by GPT-4 and manually using the V-Triad. We also combine GPT-4 with the V-Triad to assess their combined potential. A fourth group, exposed to generic phishing emails, was our control group. We use a red teaming approach by simulating attackers and emailing 112 participants recruited for the study. The control group emails received a click-through rate between 19-28%, the GPT-generated emails 30-44%, emails generated by the V-Triad 69-79%, and emails generated by GPT and the V-Triad 43-81%. Each participant was asked to explain why they pressed or did not press a link in the email. These answers often contradict each other, highlighting the importance of personal differences. Next, we used four popular large language models (GPT, Claude, PaLM, and LLaMA) to detect the intention of phishing emails and compare the results to human detection. The language models demonstrated a strong ability to detect malicious intent, even in non-obvious phishing emails. They sometimes surpassed human detection, although often being slightly less accurate than humans. Finally, we analyze of the economic aspects of AI-enabled phishing attacks, showing how large language models increase the incentives of phishing and spear phishing by reducing their costs.
引用
收藏
页码:42131 / 42146
页数:16
相关论文
共 50 条
  • [1] ChatPhishDetector: Detecting Phishing Sites Using Large Language Models
    Koide, Takashi
    Nakano, Hiroki
    Chiba, Daiki
    IEEE ACCESS, 2024, 12 : 154381 - 154400
  • [2] Utilizing Large Language Models with Human Feedback Integration for Generating Dedicated Warning for Phishing Emails
    Nguyen, Quan Hong
    Wu, Tingmin
    Nguyen, Van
    Yuan, Xingliang
    Xue, Jason
    Rudolph, Carsten
    PROCEEDINGS OF THE 2ND ACM WORKSHOP ON SECURE AND TRUSTWORTHY DEEP LEARNING SYSTEMS, SECTL 2024, 2024, : 35 - 46
  • [3] Detecting Phishing Sites Using URLs Collected from Emails
    Wang, Chuan-Sheng
    Hsu, Fu-Hau
    Chen, Shih-Jen
    Hwang, Yan-Ling
    Wu, Min-Hao
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 916 - +
  • [4] An improved transformer-based model for detecting phishing, spam and ham emails: A large language model approach
    Jamal, Suhaima
    Wimmer, Hayden
    Sarker, Iqbal H.
    SECURITY AND PRIVACY, 2024, 7 (05)
  • [5] LMs go Phishing: Adapting Pre-trained Language Models to Detect Phishing Emails
    Misra, Kanishka
    Rayz, Julia Taylor
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 135 - 142
  • [6] Detecting Spear-phishing Emails Based on Authentication
    Wang Xiujuan
    Zhang Chenxi
    Zheng Kangfeng
    Tang Haoyang
    Tao Yuanrui
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 450 - 456
  • [7] A Sender-Centric Approach to Detecting Phishing Emails
    Sanchez, Fernando
    Duan, Zhenhai
    2012 ASE INTERNATIONAL CONFERENCE ON CYBER SECURITY (CYBERSECURITY), 2012, : 32 - 39
  • [8] Generating Phishing Emails Using Graph Database
    Maleki, Nasim
    Ghorbani, Ali A.
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2019, 2019, 11879 : 434 - 449
  • [9] Comprehensive Method for Detecting Phishing Emails Using Correlation-based Analysis and User Participation
    Verma, Rakesh
    El Aassal, Ayman
    PROCEEDINGS OF THE SEVENTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY (CODASPY'17), 2017, : 155 - 157
  • [10] Detecting hallucinations in large language models using semantic entropy
    Farquhar, Sebastian
    Kossen, Jannik
    Kuhn, Lorenz
    Gal, Yarin
    NATURE, 2024, 630 (8017) : 625 - +