Large language models to identify social determinants of health in electronic health records

被引:52
|
作者
Guevara, Marco [1 ,2 ]
Chen, Shan [1 ,2 ]
Thomas, Spencer [1 ,2 ,3 ]
Chaunzwa, Tafadzwa L. [1 ,2 ]
Franco, Idalid [2 ]
Kann, Benjamin H. [1 ,2 ]
Moningi, Shalini [2 ]
Qian, Jack M. [1 ,2 ]
Goldstein, Madeleine [4 ]
Harper, Susan [4 ]
Aerts, Hugo J. W. L. [1 ,2 ,5 ,6 ]
Catalano, Paul J. [7 ,8 ]
Savova, Guergana K. [3 ]
Mak, Raymond H. [1 ,2 ]
Bitterman, Danielle S. [1 ,2 ]
机构
[1] Harvard Med Sch, Artificial Intelligence Med AIM Program, Mass Gen Brigham, Boston, MA 02115 USA
[2] Brigham & Womens Hosp, Dana Farber Canc Inst, Dept Radiat Oncol, Boston, MA 02115 USA
[3] Harvard Med Sch, Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA USA
[4] Dana Farber Canc Inst, Adult Resource Off, Boston, MA USA
[5] Maastricht Univ, Radiol & Nucl Med, GROW, Maastricht, Netherlands
[6] Maastricht Univ, CARIM, Maastricht, Netherlands
[7] Dana Farber Canc Inst, Dept Data Sci, Boston, MA USA
[8] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA
基金
欧洲研究理事会;
关键词
ADVERSE CHILDHOOD EXPERIENCES; UNITED-STATES; SUPPORT; MORTALITY; SURVIVAL; WOMEN;
D O I
10.1038/s41746-023-00970-0
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Social determinants of health (SDoH) play a critical role in patient outcomes, yet their documentation is often missing or incomplete in the structured data of electronic health records (EHRs). Large language models (LLMs) could enable high-throughput extraction of SDoH from the EHR to support research and clinical care. However, class imbalance and data limitations present challenges for this sparsely documented yet critical information. Here, we investigated the optimal methods for using LLMs to extract six SDoH categories from narrative text in the EHR: employment, housing, transportation, parental status, relationship, and social support. The best-performing models were fine-tuned Flan-T5 XL for any SDoH mentions (macro-F1 0.71), and Flan-T5 XXL for adverse SDoH mentions (macro-F1 0.70). Adding LLM-generated synthetic data to training varied across models and architecture, but improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23). Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in the zero- and few-shot setting, except GPT4 with 10-shot prompting for adverse SDoH. Fine-tuned models were less likely than ChatGPT to change their prediction when race/ethnicity and gender descriptors were added to the text, suggesting less algorithmic bias (p < 0.05). Our models identified 93.8% of patients with adverse SDoH, while ICD-10 codes captured 2.0%. These results demonstrate the potential of LLMs in improving real-world evidence on SDoH and assisting in identifying patients who could benefit from resource support.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Large language models for accurate disease detection in electronic health records: the examples of crystal arthropathies
    Burgisser, Nils
    Chalot, Etienne
    Mehouachi, Samia
    Buclin, Clement P.
    Lauper, Kim
    Courvoisier, Delphine S.
    Mongin, Denis
    RMD OPEN, 2024, 10 (04):
  • [22] AUTOMATED IDENTIFICATION OF RECURRENT GASTROINTESTINAL BLEEDING USING ELECTRONIC HEALTH RECORDS AND LARGE LANGUAGE MODELS
    Zheng, Neil S.
    Keloth, Vipina K.
    You, Kisung
    Li, Darrick K.
    Xu, Hua
    Laine, Loren
    Shung, Dennis
    GASTROENTEROLOGY, 2024, 166 (05) : S292 - S292
  • [23] The Transformative Potential of Large Language Models in Mining Electronic Health Records Data: Content Analysis
    Zurita, Amadeo Jesus Wals
    del Rio, Hector Miras
    de Aguirre, Nerea Ugarte Ruiz
    Navarro, Cristina Nebrera
    Jimenez, Maria Rubio
    Carmona, David Munoz
    Sanchez, Carlos Miguez
    JMIR MEDICAL INFORMATICS, 2025, 13
  • [24] Extracting social determinants of health from inpatient electronic medical records using natural language processing
    Martin, Elliot A.
    D'Souza, Adam G.
    Saini, Vineet
    Tang, Karen
    Quan, Hude
    Eastwood, Cathy A.
    JOURNAL OF EPIDEMIOLOGY AND POPULATION HEALTH, 2024, 72 (06):
  • [25] Classifying Unstructured Text in Electronic Health Records for Mental Health Prediction Models: Large Language Model Evaluation Study
    Cardamone, Nicholas C.
    Olfson, Mark
    Schmutte, Timothy
    Ungar, Lyle
    Liu, Tony
    Cullen, Sara W.
    Williams, Nathaniel J.
    Marcus, Steven C.
    JMIR MEDICAL INFORMATICS, 2025, 13
  • [26] Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review
    Rajwal, Swati
    Zhang, Ziyuan
    Chen, Yankai
    Rogers, Hannah
    Sarker, Abeed
    Xiao, Yunyu
    JMIR RESEARCH PROTOCOLS, 2025, 14
  • [27] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Deng, Yu
    Pacheco, Jennifer A.
    Ghosh, Anika
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua C.
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Dorn, Chad
    Weng, Chunhua
    Liu, Cong
    Cordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 22 (SUPPL 2)
  • [28] Natural language processing to identify lupus nephritis phenotype in electronic health records
    Yu Deng
    Jennifer A. Pacheco
    Anika Ghosh
    Anh Chung
    Chengsheng Mao
    Joshua C. Smith
    Juan Zhao
    Wei-Qi Wei
    April Barnado
    Chad Dorn
    Chunhua Weng
    Cong Liu
    Adam Cordon
    Jingzhi Yu
    Yacob Tedla
    Abel Kho
    Rosalind Ramsey-Goldman
    Theresa Walunas
    Yuan Luo
    BMC Medical Informatics and Decision Making, 22
  • [29] Natural Language Processing to Identify Lupus Nephritis Phenotype in Electronic Health Records
    Deng, Yu
    Pacheco, Jennifer
    Chung, Anh
    Mao, Chengsheng
    Smith, Joshua
    Zhao, Juan
    Wei, Wei-Qi
    Barnado, April
    Weng, Chunhua
    Liu, Cong
    Gordon, Adam
    Yu, Jingzhi
    Tedla, Yacob
    Kho, Abel
    Ramsey-Goldman, Rosalind
    Walunas, Theresa
    Luo, Yuan
    ARTHRITIS & RHEUMATOLOGY, 2021, 73 : 666 - 667
  • [30] Relative accuracy of social and medical determinants of suicide in electronic health records
    Alemi, Farrokh
    Avramovic, Sanja
    Renshaw, Keith D.
    Kanchi, Rania
    Schwartz, Mark
    HEALTH SERVICES RESEARCH, 2020, 55 : 833 - 840