Medical large language models are susceptible to targeted misinformation attacks

被引:3
|
作者
Han, Tianyu [1 ]
Nebelung, Sven [1 ]
Khader, Firas [1 ]
Wang, Tianci [1 ]
Mueller-Franzes, Gustav [1 ]
Kuhl, Christiane [1 ]
Foersch, Sebastian [2 ]
Kleesiek, Jens [3 ]
Haarburger, Christoph [4 ]
Bressem, Keno K. [5 ,6 ,7 ,8 ]
Kather, Jakob Nikolas [9 ,10 ,11 ]
Truhn, Daniel [1 ]
机构
[1] Univ Hosp Aachen, Dept Diag & Intervent Radiol, Aachen, Germany
[2] Univ Med Ctr Johannes Gutenberg, Inst Pathol, Mainz, Germany
[3] Univ Med Essen, Inst AI Med, Essen, Germany
[4] Ocumeda GmbH, Munich, Germany
[5] Charite Univ Med Berlin, Dept Radiol, Berlin, Germany
[6] Free Univ Berlin, Berlin, Germany
[7] Humboldt Univ, Berlin, Germany
[8] Charite Univ Med Berlin, Berlin Inst Hlth, Berlin, Germany
[9] Tech Univ Dresden, Else Kroener Fresenius Ctr Digital Hlth EKFZ, Dresden, Germany
[10] Univ Hosp Dresden, Dept Med 1, Dresden, Germany
[11] Univ Hosp Heidelberg, Natl Ctr Tumor Dis NCT, Med Oncol, Heidelberg, Germany
来源
NPJ DIGITAL MEDICINE | 2024年 / 7卷 / 01期
基金
美国国家卫生研究院; 欧洲研究理事会;
关键词
All Open Access; Gold;
D O I
10.1038/s41746-024-01282-7
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Large language models (LLMs) have broad medical knowledge and can reason about medical information across many domains, holding promising potential for diverse medical applications in the near future. In this study, we demonstrate a concerning vulnerability of LLMs in medicine. Through targeted manipulation of just 1.1% of the weights of the LLM, we can deliberately inject incorrect biomedical facts. The erroneous information is then propagated in the model's output while maintaining performance on other biomedical tasks. We validate our findings in a set of 1025 incorrect biomedical facts. This peculiar susceptibility raises serious security and trustworthiness concerns for the application of LLMs in healthcare settings. It accentuates the need for robust protective measures, thorough verification mechanisms, and stringent management of access to these models, ensuring their reliable and safe use in medical practice.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] TF-Attack: Transferable and fast adversarial attacks on large language models
    Li, Zelin
    Chen, Kehai
    Liu, Lemao
    Bai, Xuefeng
    Yang, Mingming
    Xiang, Yang
    Zhang, Min
    KNOWLEDGE-BASED SYSTEMS, 2025, 312
  • [42] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
    Zhang, Zhexin
    Yang, Junxiao
    Ke, Pei
    Mi, Fei
    Wang, Hongning
    Huang, Minlie
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 8865 - 8887
  • [43] A Comprehensive Overview of Backdoor Attacks in Large Language Models Within Communication Networks
    Yang, Haomiao
    Xiang, Kunlan
    Ge, Mengyu
    Li, Hongwei
    Lu, Rongxing
    Yu, Shui
    IEEE NETWORK, 2024, 38 (06): : 211 - 218
  • [44] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [45] Large language models (ChatGPT) in medical education: Embrace or abjure?
    Luke, Nathasha
    Taneja, Reshma
    Ban, Kenneth
    Samarasekera, Dujeepa
    Yap, Celestial T.
    ASIA PACIFIC SCHOLAR, 2023, 8 (04): : 50 - 52
  • [46] Large language models for generating medical examinations: systematic review
    Artsi, Yaara
    Sorin, Vera
    Konen, Eli
    Glicksberg, Benjamin S.
    Nadkarni, Girish
    Klang, Eyal
    BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [47] Leveraging foundation and large language models in medical artificial intelligence
    Wong, Io Nam
    Monteiro, Olivia
    Baptista-Hon, Daniel T.
    Wang, Kai
    Lu, Wenyang
    Sun, Zhuo
    Nie, Sheng
    Yin, Yun
    CHINESE MEDICAL JOURNAL, 2024, 137 (21) : 2529 - 2539
  • [48] A systematic review of large language models and their implications in medical education
    Lucas, Harrison C.
    Upperman, Jeffrey S.
    Robinson, Jamie R.
    MEDICAL EDUCATION, 2024, 58 (11) : 1276 - 1285
  • [49] Leveraging foundation and large language models in medical artificial intelligence
    Wong Io Nam
    Monteiro Olivia
    BaptistaHon Daniel T
    Wang Kai
    Lu Wenyang
    Sun Zhuo
    Nie Sheng
    Yin Yun
    中华医学杂志英文版, 2024, 137 (21)
  • [50] Enhancing the assessment of large language models in medical information generation
    Leiwa, Aher K.
    Lhusseiny, Bdelrahman M.
    OPHTHALMOLOGY RETINA, 2024, 8 (05): : e15 - e15