Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

被引:0
|
作者
Patel, Het [1 ]
Reiman, Umair [1 ]
Iqbal, Farkhund [2 ]
机构
[1] Western Univ, Dept Comp Sci, London, ON, Canada
[2] Zayed Univ, Coll Technol Innovat, Dubai, U Arab Emirates
关键词
Phishing Email Detection; Large Language Models (LLMs); General Pretrained Transformer (GPT); Bidirectional Encoder Representations from Transformers (BERT); Natural Processing Language (NPL); Social Engineering;
D O I
10.1109/HSI61632.2024.10613528
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's digital world. By leveraging clever social engineering elements and modern technology, cybercrime targets many individuals, businesses, and organizations to exploit trust and security. These cyberattackers are often disguised in many trustworthy forms to appear as legitimate sources. By cleverly using psychological elements like urgency, fear, social proof, and other manipulative strategies, phishers can lure individuals into revealing sensitive and personalized information. Building on this pervasive issue within modern technology, this paper will aim to analyze the effectiveness of 15 Large Language Models (LLMs) in detecting phishing attempts, specifically focusing on a randomized set of "419 Scam" emails. The objective is to determine which LLMs can accurately detect phishing emails by analyzing a text file containing email metadata based on predefined criteria. The experiment concluded that the following models, ChatGPT 3.5, GPT-3.5-Turbo-Instruct, and ChatGPT, were the most effective in detecting phishing emails.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Evaluating the Application of Large Language Models to Generate Feedback in Programming Education
    Jacobs, Sven
    Jaschke, Steffen
    [J]. 2024 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE, EDUCON 2024, 2024,
  • [42] LMs go Phishing: Adapting Pre-trained Language Models to Detect Phishing Emails
    Misra, Kanishka
    Rayz, Julia Taylor
    [J]. 2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 135 - 142
  • [43] Assessing the efficacy of large language models in spinal health information dissemination
    McLean, Aaron Lawson
    Senft, Christian
    [J]. BRAIN AND SPINE, 2024, 4
  • [44] Evaluating the Diagnostic Performance of Large Language Models on Complex Multimodal Medical Cases
    Chiu, Wan Hang Keith
    Ko, Wei Sum Koel
    Cho, William Chi Shing
    Hui, Sin Yu Joanne
    Chan, Wing Chi Lawrence
    Kuo, Michael D.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [45] Evaluating Open-Domain Question Answering in the Era of Large Language Models
    Kamalloo, Ehsan
    Dziri, Nouha
    Clarke, Charles L. A.
    Rafiei, Davood
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5591 - 5606
  • [46] Invited Paper: VerilogEval: Evaluating Large Language Models for Verilog Code Generation
    Liu, Mingjie
    Pinckney, Nathaniel
    Khailany, Brucek
    Ren, Haoxing
    [J]. 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [47] Evaluating Large Language Models in Process Mining: Capabilities, Benchmarks, and Evaluation Strategies
    Berti, Alessandro
    Kourani, Humam
    Haefke, Hannes
    Li, Chiao-Yun
    Schuster, Daniel
    [J]. ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, BPMDS 2024, EMMSAD 2024, 2024, 511 : 13 - 21
  • [48] Evaluating the Adaptability of Large Language Models for Knowledge-aware Question and Answering
    Thakkar, Jay
    Kolekar, Suresh
    Gite, Shilpa
    Pradhan, Biswajeet
    Alamri, Abdullah
    [J]. INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
  • [49] Evaluating the Accuracy and Utility of Large Language Models in Answering Common Contraception Questions
    Patel, Anisha V.
    Jasani, Sona
    Alashqar, Abdelrahman
    Panakam, Aisvarya
    Amin, Kanhai
    Sheth, Sangini S.
    [J]. OBSTETRICS AND GYNECOLOGY, 2024, 143 (5S): : 13S - 13S
  • [50] Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers
    Subbiah, Melanie
    Zhang, Sean
    Chilton, Lydia B.
    Mckeown, Kathleen
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1290 - 1310