Leveraging large language models for generating responses to patient messages-a subjective analysis

被引:10
|
作者
Liu, Siru [1 ,5 ]
Mccoy, Allison B. [1 ]
Wright, Aileen P. [1 ,2 ]
Carew, Babatunde [3 ]
Genkins, Julian Z. [4 ]
Huang, Sean S. [1 ,2 ]
Peterson, Josh F. [1 ,2 ]
Steitz, Bryan [1 ]
Wright, Adam [1 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37212 USA
[2] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37212 USA
[3] Vanderbilt Univ, Med Ctr, Dept Gen Internal Med & Publ Hlth, Nashville, TN 37212 USA
[4] Stanford Univ, Dept Med, Stanford, CA 94304 USA
[5] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, 2525 West End Ave 1475, Nashville, TN 37212 USA
关键词
artificial intelligence; clinical decision support; large language model; patient portal; primary care;
D O I
10.1093/jamia/ocae052
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study aimed to develop and assess the performance of fine-tuned large language models for generating responses to patient messages sent via an electronic health record patient portal.Materials and Methods Utilizing a dataset of messages and responses extracted from the patient portal at a large academic medical center, we developed a model (CLAIR-Short) based on a pre-trained large language model (LLaMA-65B). In addition, we used the OpenAI API to update physician responses from an open-source dataset into a format with informative paragraphs that offered patient education while emphasizing empathy and professionalism. By combining with this dataset, we further fine-tuned our model (CLAIR-Long). To evaluate fine-tuned models, we used 10 representative patient portal questions in primary care to generate responses. We asked primary care physicians to review generated responses from our models and ChatGPT and rated them for empathy, responsiveness, accuracy, and usefulness.Results The dataset consisted of 499 794 pairs of patient messages and corresponding responses from the patient portal, with 5000 patient messages and ChatGPT-updated responses from an online platform. Four primary care physicians participated in the survey. CLAIR-Short exhibited the ability to generate concise responses similar to provider's responses. CLAIR-Long responses provided increased patient educational content compared to CLAIR-Short and were rated similarly to ChatGPT's responses, receiving positive evaluations for responsiveness, empathy, and accuracy, while receiving a neutral rating for usefulness.Conclusion This subjective analysis suggests that leveraging large language models to generate responses to patient messages demonstrates significant potential in facilitating communication between patients and healthcare providers.
引用
收藏
页码:1367 / 1379
页数:13
相关论文
共 50 条
  • [41] Position Paper: Leveraging Large Language Models for Cybersecurity Compliance
    Salman, Ahmed
    Creese, Sadie
    Goldsmith, Michael
    9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 496 - 503
  • [42] Leveraging foundation and large language models in medical artificial intelligence
    Wong, Io Nam
    Monteiro, Olivia
    Baptista-Hon, Daniel T.
    Wang, Kai
    Lu, Wenyang
    Sun, Zhuo
    Nie, Sheng
    Yin, Yun
    CHINESE MEDICAL JOURNAL, 2024, 137 (21) : 2529 - 2539
  • [43] Leveraging Large Language Models to Detect npm Malicious Packages
    Zahan, Nusrat
    Burckhardt, Philipp
    Lysenko, Mikola
    Aboukhadijeh, Feross
    Williams, Laurie
    arXiv,
  • [44] Leveraging Large Language Models for Efficient Alert Aggregation in AIOPs
    Zha, Junjie
    Shan, Xinwen
    Lu, Jiaxin
    Zhu, Jiajia
    Liu, Zihan
    ELECTRONICS, 2024, 13 (22)
  • [45] Leveraging Large Language Models for Activity Recognition in Smart Environments
    Cleland, Ian
    Nugent, Luke
    Cruciani, Federico
    Nugent, Chris
    2024 INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING, ABC 2024, 2024,
  • [46] Leveraging Large Language Models for Automatic Smart Contract Generation
    Napoli, Emanuele Antonio
    Barbara, Fadi
    Gatteschi, Valentina
    Schifanella, Claudio
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 701 - 710
  • [47] Leveraging Large Language Models for Automated Chinese Essay Scoring
    Feng, Haiyue
    Du, Sixuan
    Zhu, Gaoxia
    Zou, Yan
    Poh Boon Phua
    Feng, Yuhong
    Zhong, Haoming
    Shen, Zhiqi
    Liu, Siyuan
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 454 - 467
  • [48] Leveraging large language models for daily tourist demand forecasting
    He, Kaijian
    Zheng, Linyuan
    Wu, Don
    Zou, Yingchao
    CURRENT ISSUES IN TOURISM, 2024,
  • [49] Leveraging Large Language Models for Python']Python Unit Test
    Jiri, Medlen
    Emese, Bari
    Medlen, Patrick
    2024 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2024, : 95 - 100
  • [50] Leveraging Large Language Models for Decision Support in Personalized Oncology
    Benary, Manuela
    Wang, Xing David
    Schmidt, Max
    Soll, Dominik
    Hilfenhaus, Georg
    Nassir, Mani
    Sigler, Christian
    Knoedler, Maren
    Keller, Ulrich
    Beule, Dieter
    Keilholz, Ulrich
    Leser, Ulf
    Rieke, Damian T.
    JAMA NETWORK OPEN, 2023, 6 (11) : E2343689