Leveraging large language models for generating responses to patient messages-a subjective analysis

被引:10
|
作者
Liu, Siru [1 ,5 ]
Mccoy, Allison B. [1 ]
Wright, Aileen P. [1 ,2 ]
Carew, Babatunde [3 ]
Genkins, Julian Z. [4 ]
Huang, Sean S. [1 ,2 ]
Peterson, Josh F. [1 ,2 ]
Steitz, Bryan [1 ]
Wright, Adam [1 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37212 USA
[2] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37212 USA
[3] Vanderbilt Univ, Med Ctr, Dept Gen Internal Med & Publ Hlth, Nashville, TN 37212 USA
[4] Stanford Univ, Dept Med, Stanford, CA 94304 USA
[5] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, 2525 West End Ave 1475, Nashville, TN 37212 USA
关键词
artificial intelligence; clinical decision support; large language model; patient portal; primary care;
D O I
10.1093/jamia/ocae052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study aimed to develop and assess the performance of fine-tuned large language models for generating responses to patient messages sent via an electronic health record patient portal.Materials and Methods Utilizing a dataset of messages and responses extracted from the patient portal at a large academic medical center, we developed a model (CLAIR-Short) based on a pre-trained large language model (LLaMA-65B). In addition, we used the OpenAI API to update physician responses from an open-source dataset into a format with informative paragraphs that offered patient education while emphasizing empathy and professionalism. By combining with this dataset, we further fine-tuned our model (CLAIR-Long). To evaluate fine-tuned models, we used 10 representative patient portal questions in primary care to generate responses. We asked primary care physicians to review generated responses from our models and ChatGPT and rated them for empathy, responsiveness, accuracy, and usefulness.Results The dataset consisted of 499 794 pairs of patient messages and corresponding responses from the patient portal, with 5000 patient messages and ChatGPT-updated responses from an online platform. Four primary care physicians participated in the survey. CLAIR-Short exhibited the ability to generate concise responses similar to provider's responses. CLAIR-Long responses provided increased patient educational content compared to CLAIR-Short and were rated similarly to ChatGPT's responses, receiving positive evaluations for responsiveness, empathy, and accuracy, while receiving a neutral rating for usefulness.Conclusion This subjective analysis suggests that leveraging large language models to generate responses to patient messages demonstrates significant potential in facilitating communication between patients and healthcare providers.
引用
收藏
页码:1367 / 1379
页数:13
相关论文
共 50 条
  • [1] Prompt engineering on leveraging large language models in generating response to InBasket messages
    Yan, Sherry
    Knapp, Wendi
    Leong, Andrew
    Kadkhodazadeh, Sarira
    Das, Souvik
    Jones, Veena G.
    Clark, Robert
    Grattendick, David
    Chen, Kevin
    Hladik, Lisa
    Fagan, Lawrence
    Chan, Albert
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2263 - 2270
  • [2] Leveraging Large Language Models for Generating Personalized Care Recommendations in Dementia
    Hu, Hsiang-Wei
    Lin, Yu-chun
    Chia, Chang-Hung
    Chuang, Ethan
    Yang, Cheng Ru
    2024 IEEE INTERNATIONAL WORKSHOP ON ELECTROMAGNETICS: APPLICATIONS AND STUDENT INNOVATION COMPETITION, IWEM 2024, 2024,
  • [3] Large Language Model Responses to Adolescent Patient and Proxy Messages
    Tse, Gabriel
    Zahedivash, Aydin
    Anoshiravani, Arash
    Carlson, Jennifer
    Haberkorn, William
    Morse, Keith E.
    JAMA PEDIATRICS, 2025, 179 (01) : 93 - 94
  • [4] Leveraging Large Language Models for Automated Dialogue Analysis
    Finch, Sarah E.
    Paek, Ellie S.
    Choi, Jinho D.
    24TH MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2023, 2023, : 202 - 215
  • [5] Leveraging large language models for data analysis automation
    Jansen, Jacqueline A.
    Manukyan, Artur
    Al Khoury, Nour
    Akalin, Altuna
    PLOS ONE, 2025, 20 (02):
  • [6] Towards Generating High-Quality Knowledge Graphs by Leveraging Large Language Models
    Ezzabady, Morteza Kamaladdini
    Ieng, Frederic
    Khorashadizadeh, Hanieh
    Benamara, Farah
    Groppe, Sven
    Sahri, Soror
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 455 - 469
  • [7] Leveraging Large Language Models for Generating Mobile Sensing Strategies in Human Behavior Modeling
    Gao, Nan
    Yu, Zhuolei
    Xu, Yue
    Yu, Chun
    Wang, Yuntao
    Salim, Flora D.
    Shi, Yuanchun
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 729 - 735
  • [8] Leveraging large language models in dermatology
    Matin, Rubeta N.
    Linos, Eleni
    Rajan, Neil
    BRITISH JOURNAL OF DERMATOLOGY, 2023, 189 (03) : 253 - 254
  • [9] Leveraging Large Language Models for Analysis of Student Course Feedback
    Wang, Zixuan
    Denny, Paul
    Leinonen, Juho
    Luxton-Reilly, Andrew
    PROCEEDINGS OF THE 16TH ANNUAL ACM INDIA COMPUTE CONFERENCE, COMPUTE 2023, 2023, : 76 - 79
  • [10] Leveraging Large Language Models for Efficient Failure Analysis in Game Development
    Marini, Leonardo
    Gisslen, Linus
    Sestini, Alessandro
    2024 IEEE CONFERENCE ON GAMES, COG 2024, 2024,