Leveraging large language models for generating responses to patient messages-a subjective analysis

被引：10

作者：

Liu, Siru ^{[1
,5
]}

Mccoy, Allison B. ^{[1
]}

Wright, Aileen P. ^{[1
,2
]}

Carew, Babatunde ^{[3
]}

Genkins, Julian Z. ^{[4
]}

Huang, Sean S. ^{[1
,2
]}

Peterson, Josh F. ^{[1
,2
]}

Steitz, Bryan ^{[1
]}

Wright, Adam ^{[1
]}

机构：

[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37212 USA

[2] Vanderbilt Univ, Med Ctr, Dept Med, Nashville, TN 37212 USA

[3] Vanderbilt Univ, Med Ctr, Dept Gen Internal Med & Publ Hlth, Nashville, TN 37212 USA

[4] Stanford Univ, Dept Med, Stanford, CA 94304 USA

[5] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, 2525 West End Ave 1475, Nashville, TN 37212 USA

来源：

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | 2024年 / 31卷 / 06期

关键词：

artificial intelligence; clinical decision support; large language model; patient portal; primary care;

D O I：

10.1093/jamia/ocae052

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Objective This study aimed to develop and assess the performance of fine-tuned large language models for generating responses to patient messages sent via an electronic health record patient portal.Materials and Methods Utilizing a dataset of messages and responses extracted from the patient portal at a large academic medical center, we developed a model (CLAIR-Short) based on a pre-trained large language model (LLaMA-65B). In addition, we used the OpenAI API to update physician responses from an open-source dataset into a format with informative paragraphs that offered patient education while emphasizing empathy and professionalism. By combining with this dataset, we further fine-tuned our model (CLAIR-Long). To evaluate fine-tuned models, we used 10 representative patient portal questions in primary care to generate responses. We asked primary care physicians to review generated responses from our models and ChatGPT and rated them for empathy, responsiveness, accuracy, and usefulness.Results The dataset consisted of 499 794 pairs of patient messages and corresponding responses from the patient portal, with 5000 patient messages and ChatGPT-updated responses from an online platform. Four primary care physicians participated in the survey. CLAIR-Short exhibited the ability to generate concise responses similar to provider's responses. CLAIR-Long responses provided increased patient educational content compared to CLAIR-Short and were rated similarly to ChatGPT's responses, receiving positive evaluations for responsiveness, empathy, and accuracy, while receiving a neutral rating for usefulness.Conclusion This subjective analysis suggests that leveraging large language models to generate responses to patient messages demonstrates significant potential in facilitating communication between patients and healthcare providers.

引用

页码：1367 / 1379

页数：13

共 50 条

[41] Position Paper: Leveraging Large Language Models for Cybersecurity Compliance
Salman, Ahmed
Creese, Sadie
Goldsmith, Michael
9TH IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, EUROS&PW 2024, 2024, : 496 - 503
[42] Leveraging foundation and large language models in medical artificial intelligence
Wong, Io Nam
Monteiro, Olivia
Baptista-Hon, Daniel T.
Wang, Kai
Lu, Wenyang
Sun, Zhuo
Nie, Sheng
Yin, Yun
CHINESE MEDICAL JOURNAL, 2024, 137 (21) : 2529 - 2539
[43] Leveraging Large Language Models to Detect npm Malicious Packages
Zahan, Nusrat
Burckhardt, Philipp
Lysenko, Mikola
Aboukhadijeh, Feross
Williams, Laurie
arXiv,
[44] Leveraging Large Language Models for Efficient Alert Aggregation in AIOPs
Zha, Junjie
Shan, Xinwen
Lu, Jiaxin
Zhu, Jiajia
Liu, Zihan
ELECTRONICS, 2024, 13 (22)
[45] Leveraging Large Language Models for Activity Recognition in Smart Environments
Cleland, Ian
Nugent, Luke
Cruciani, Federico
Nugent, Chris
2024 INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING, ABC 2024, 2024,
[46] Leveraging Large Language Models for Automatic Smart Contract Generation
Napoli, Emanuele Antonio
Barbara, Fadi
Gatteschi, Valentina
Schifanella, Claudio
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 701 - 710
[47] Leveraging Large Language Models for Automated Chinese Essay Scoring
Feng, Haiyue
Du, Sixuan
Zhu, Gaoxia
Zou, Yan
Poh Boon Phua
Feng, Yuhong
Zhong, Haoming
Shen, Zhiqi
Liu, Siyuan
ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 454 - 467
[48] Leveraging large language models for daily tourist demand forecasting
He, Kaijian
Zheng, Linyuan
Wu, Don
Zou, Yingchao
CURRENT ISSUES IN TOURISM, 2024,
[49] Leveraging Large Language Models for Python']Python Unit Test
Jiri, Medlen
Emese, Bari
Medlen, Patrick
2024 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2024, : 95 - 100
[50] Leveraging Large Language Models for Decision Support in Personalized Oncology
Benary, Manuela
Wang, Xing David
Schmidt, Max
Soll, Dominik
Hilfenhaus, Georg
Nassir, Mani
Sigler, Christian
Knoedler, Maren
Keller, Ulrich
Beule, Dieter
Keilholz, Ulrich
Leser, Ulf
Rieke, Damian T.
JAMA NETWORK OPEN, 2023, 6 (11) : E2343689

← 1 2 3 4 5 →