Performance of Large Language Models in Patient Complaint Resolution: Web-Based Cross-Sectional Survey

被引:0
|
作者
Yong, Lorraine Pei Xian [1 ,2 ,3 ]
Tung, Joshua Yi Min [4 ]
Lee, Zi Yao [1 ,2 ,3 ]
Kuan, Win Sen [1 ,2 ,3 ]
Chua, Mui Teng [1 ,2 ,3 ]
机构
[1] Natl Univ Hlth Syst, Natl Univ Hosp, Emergency Med Dept, 5 Lower Kent Ridge Rd, Singapore 119074, Singapore
[2] Natl Univ Singapore, Yong Loo Lin Sch Med, Dept Surg, Singapore, Singapore
[3] Natl Univ Hlth Syst, Alexandra Hosp, Urgent Care Ctr, Singapore, Singapore
[4] Singapore Gen Hosp, Dept Urol, Singapore, Singapore
关键词
ChatGPT; large language models; artificial intelligence; patient complaint; health care complaint; empathy; efficiency; patient satisfaction; resource allocation; SEX-DIFFERENCES; HEALTH; EMPATHY;
D O I
10.2196/56413
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Patient complaints are a perennial challenge faced by health care institutions globally, requiring extensive time and effort from health care workers. Despite these efforts, patient dissatisfaction remains high. Recent studies on the use of large language models (LLMs) such as the GPT models developed by Open AI in the health care sector have shown great promise, with the ability to provide more detailed and empathetic responses as compared to physicians. LLMs could potentially be used in responding to patient complaints to improve patient satisfaction and complaint response time. Objective: This study aims to evaluate the performance of LLMs in addressing patient complaints received by a tertiary healthcare institution, with the goal of enhancing patient satisfaction. Methods: Anonymized patient complaint emails and associated responses from the patient relations department were obtained.ChatGPT-4.0 (Open AI, Inc) was provided with the same complaint email and tasked to generate a response. The complaints and the respective responses were uploaded onto a web-based questionnaire. Respondents were asked to rate both responses on a10-point Likert scale for 4 items: appropriateness, completeness, empathy, and satisfaction. Participants were also asked to choose a preferred response at the end of each scenario. Results: There was a total of 188 respondents, of which 115 (61.2%) were health care workers. A majority of the respondents, including both health care and non-health care workers, preferred replies from Chat GPT (n=164, 87.2% to n=183, 97.3%).GPT-4.0 responses were rated higher in all 4 assessed items with all median scores of 8 (IQR 7-9) compared to human responses(appropriateness 5, IQR 3-7; empathy 4, IQR 3-6; quality 5, IQR 3-6; satisfaction 5, IQR 3-6; P<.001) and had higher average word counts as compared to human responses (238 vs 76 words). Regression analyses showed that a higher word count was a statistically significant predictor of higher score in all 4 items, with every 1-word increment resulting in an increase in scores of between 0.015 and 0.019 (all P<.001). However, on subgroup analysis by authorship, this only held true for responses written by patient relations department staff and not those generated by Chat GPT which received consistently high scores irrespective of response length. Conclusions: This study provides significant evidence supporting the effectiveness of LLMs in resolution of patient complaints. Chat GPT demonstrated superiority in terms of response appropriateness, empathy, quality, and overall satisfaction when compared against actual human responses to patient complaints. Future research can be done to measure the degree of improvement that artificial intelligence generated responses can bring in terms of time savings, cost-effectiveness, patient satisfaction, and stress reduction for the health care system.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Headache in Medical Residents: A Cross-Sectional Web-Based Survey
    de Melo Silva Junior, Mario Luciano
    Melo, Thayanara Silva
    de Sousa Menezes, Neila Clediane
    Valenca, Marcelo Moraes
    Sampaio Rocha-Filho, Pedro Augusto
    [J]. HEADACHE, 2020, 60 (10): : 2320 - 2329
  • [2] Patient Perspectives on Value Dimensions of Lung Cancer Care: Cross-sectional Web-Based Survey
    Varriale, Pasquale
    Mueller, Borna
    Katz, Gregory
    Dallas, Lorraine
    Aguaron, Alfonso
    Azoulai, Marion
    Girard, Nicolas
    [J]. JMIR FORMATIVE RESEARCH, 2023, 7
  • [3] Treatment preferences in fibromyalgia patients: A cross-sectional web-based survey
    Valentini, Elia
    Fetter, Eleonora
    Orbell, Sheina
    [J]. EUROPEAN JOURNAL OF PAIN, 2020, 24 (07) : 1290 - 1300
  • [4] Harassment in the headache field: a global web-based cross-sectional survey
    de Boer, Irene
    Ambrosini, Anna
    Singh, Rashmi B. Halker
    Baykan, Betul
    Buse, Dawn C.
    Tassorelli, Cristina
    Jensen, Rigmor H.
    Pozo-Rosich, Patricia
    Terwindt, Gisela M.
    [J]. CEPHALALGIA, 2023, 43 (08)
  • [5] Patient-reported burden of dry eye disease in the UK: a cross-sectional web-based survey
    Hossain, Parwez
    Siffel, Csaba
    Joseph, Corey
    Meunier, Juliette
    Markowitz, Jessica T.
    Dana, Reza
    [J]. BMJ OPEN, 2021, 11 (03):
  • [6] Perceptions of Quality of Care Among Users of a Web-Based Patient Portal: Cross-sectional Survey Analysis
    Lear, Rachael
    Freise, Lisa
    Kybert, Matthew
    Darzi, Ara
    Neves, Ana Luisa
    Mayer, Erik K.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (11)
  • [7] Present Situation and the Future Development of Web-Based Prenatal Education in China: Cross-sectional Web-Based Survey
    Huang, Xinyu
    Sun, Weiwei
    Wang, Renyu
    Wu, Huailiang
    Yu, Shinning
    Fang, Xuanbi
    Liu, Yiyan
    Akinwunmi, Babatunde
    Huang, Jian
    Ming, Wai-Kit
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (06)
  • [8] The quality of web-based information on hip resurfacing arthroplasty: a cross-sectional survey
    Kwong, Y.
    Kwong, F. N.
    Costa, M. L.
    [J]. HIP INTERNATIONAL, 2006, 16 (04) : 268 - 272
  • [9] The European Board of Interventional Radiology Examination: A Cross-Sectional Web-Based Survey
    Emma Tong
    Muirne Spooner
    Otto Van Delden
    Raman Uberoi
    Mark Sheehan
    Damien C. O’Neill
    Michael Lee
    [J]. CardioVascular and Interventional Radiology, 2018, 41 : 21 - 26
  • [10] The European Board of Interventional Radiology Examination: A Cross-Sectional Web-Based Survey
    Tong, Emma
    Spooner, Muirne
    Van Delden, Otto
    Uberoi, Raman
    Sheehan, Mark
    O'Neill, Damien C.
    Lee, Michael
    [J]. CARDIOVASCULAR AND INTERVENTIONAL RADIOLOGY, 2018, 41 (01) : 21 - 26