The shaky foundations of large language models and foundation models for electronic health records

被引:50
|
作者
Wornow, Michael [1 ]
Xu, Yizhe [2 ]
Thapa, Rahul [2 ]
Patel, Birju [2 ]
Steinberg, Ethan [1 ]
Fleming, Scott [2 ]
Pfeffer, Michael A. [2 ,3 ]
Fries, Jason [2 ]
Shah, Nigam H. [2 ,3 ,4 ,5 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Stanford Univ, Ctr Biomed Informat Res, Sch Med, Stanford, CA USA
[3] Stanford Hlth Care, Technol & Digital Serv, Palo Alto, CA USA
[4] Stanford Univ, Dept Med, Sch Med, Stanford, CA USA
[5] Stanford Univ, Clin Excellence Res Ctr, Sch Med, Stanford, CA USA
关键词
BIAS;
D O I
10.1038/s41746-023-00879-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The success of foundation models such as ChatGPT and AlphaFold has spurred significant interest in building similar models for electronic medical records (EMRs) to improve patient care and hospital operations. However, recent hype has obscured critical gaps in our understanding of these models' capabilities. In this narrative review, we examine 84 foundation models trained on non-imaging EMR data (i.e., clinical text and/or structured data) and create a taxonomy delineating their architectures, training data, and potential use cases. We find that most models are trained on small, narrowly-scoped clinical datasets (e.g., MIMIC-III) or broad, public biomedical corpora (e.g., PubMed) and are evaluated on tasks that do not provide meaningful insights on their usefulness to health systems. Considering these findings, we propose an improved evaluation framework for measuring the benefits of clinical foundation models that is more closely grounded to metrics that matter in healthcare.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Large language models: a new chapter in digital health
    不详
    [J]. LANCET DIGITAL HEALTH, 2024, 6 (01): : e1 - e1
  • [32] The Opportunities and Risks of Large Language Models in Mental Health
    Lawrence, Hannah R.
    Schneider, Renee A.
    Rubin, Susan B.
    Mataric, Maja J.
    McDuff, Daniel J.
    Bell, Megan Jones
    [J]. JMIR MENTAL HEALTH, 2024, 11
  • [33] Large language models: a new chapter in digital health
    The Lancet Digital Health
    [J]. The Lancet Digital Health, 2024, 6 (01):
  • [34] Large Language Models
    Cerf, Vinton G.
    [J]. COMMUNICATIONS OF THE ACM, 2023, 66 (08) : 7 - 7
  • [35] Large Language Models
    Vargas, Diego Collarana
    Katsamanis, Nassos
    [J]. ERCIM NEWS, 2024, (136): : 12 - 13
  • [36] Identification of Dietary Supplement Use from Electronic Health Records Using Transformer-based Language Models
    Zhou, Sicheng
    Schutte, Dalton
    Xing, Aiwen
    Chen, Jiyang
    Wolfson, Julian
    He, Zhe
    Yu, Fang
    Zhang, Rui
    [J]. 2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 513 - 514
  • [37] Privacy Mental Models of Electronic Health Records: A German Case Study
    Panskus, Rebecca
    Ninow, Max
    Fahl, Sascha
    Marky, Karola
    [J]. PROCEEDINGS OF THE NINETEENTH SYMPOSIUM ON USABLE PRIVACY AND SECURITY, SOUPS 2023, 2023, : 525 - 542
  • [38] Federated learning of predictive models from federated Electronic Health Records
    Brisimi, Theodora S.
    Chen, Ruidi
    Mela, Theofanie
    Olshevsky, Alex
    Paschalidis, Ioannis Ch.
    Shi, Wei
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 : 59 - 67
  • [39] Interpretation Attacks and Defenses on Predictive Models Using Electronic Health Records
    Razmi, Fereshteh
    Lou, Jian
    Hong, Yuan
    Xiong, Li
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 446 - 461
  • [40] Comorbidity network analysis using graphical models for electronic health records
    Zhao, Bo
    Huepenbecker, Sarah
    Zhu, Gen
    Rajan, Suja S.
    Fujimoto, Kayo
    Luo, Xi
    [J]. FRONTIERS IN BIG DATA, 2023, 6